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(57) Abstract 



The present invention concerns proteins encoded by a family of genes, termed here f/Dx-rclated genes, which are involved in the 
connx)l of chromatm structure and, thus in transcription and translation. The present invention makes available compositions and methods 
that can be utilized, for example, to control cell proliferation and differentiation in vitro and in vivo. 
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Histone Deacetylases, and Uses Related Thereto 
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Background of the Invention 
The organization of regulatory DNA elements into precise chromatin structures is 
mipoftant for both DNA repUcation and transcription in vivo (Lee et al (1993) Cell 
• 72:73.84; Felsenfeld (1992) Nature. 355:219). In euka^^otic cells, mxclear DNA exists as 
a hierarchy of chromatin structures, resulting in the compaction of nuclear DNA about 
10.000 fold (Davie and Hendzel (1994) J. CeU! Biochem. 55:98). The repeating 
structural unit in the extended 10 nm fibre form of chromatin is the nucleosome (van 
Holde (1988) Chromatin. New York: Springer-Verlag). The nucleosome consists of 146 
bp of DNA wrapped around a protein core of the histones H2A, H2B H3 and H4 
known as the core histones. These histones are arranged as an (H3-H4)2 tet^amer and 
two H2A-H2B dmiers positioned on each face of the tetramer. The DNA joining the 
nucleosomes is called linker DNA; it is to the linker DNA to which the HI or linker 
histones bind. The 10 mn fibre is compacted fiirther into the 30 mn fibre Linker 
histones and amino-terminal regions ("taUs") of the core histones maintain the higher 
order foldmg of chromatin (Garcia Ramirez et al.. (1992) J. Biol Chem 267: 19587) This 
chromatm structure must be relaxed when DNA is transcribed or translated. 

EDstones of the nucleosome core particle are subject to reversible acetylation at 
the E-ammo group of lysines present in their amino terminus (Csordas et al (1990) 
5/acAe« 7 265:23-38). TranscriptionaUy silent regions of the genome are enriched in 
underacetylated histone H4 (Turner (1993) Cell 75:5-8). and histone hyperacetylation 
fecUitates the abiUty of transcription factor TFniA to bind to chromatin templates (Lee et 
al. (1993) Cell 72:73-84). Recem genetic, biochemical and immunological approaches 
have provided substantial evidence indicating that histones associated with actively 
t^iscnbed genes are more highly acetylated than those from nontranscribed regions 
While not wishing to be bound by any particular theory, histone acetylation may influence 
transcnption at several stages, for example, by causing tr^mscription factors to bind or by 
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inducing structural transitions in ciironiatin, or by facilitating histone displacement and 
repositioning during polymerase elongation. 

The acetylation and deacetylation are catalyzed by specific enzymes, histone 
acetyltransferase and deacetylase, respectively, and the net level of the acetylation is 
5 controlled by the equilibrium between these enzymes. The steady state level of 
acetylation and the rates at which acetate groups are turned over vary both between and 
within different cell types, with half-lives that vary fi^om a few minutes to several hours. 
Although a histone acetyltransferase gene (HATl) has been identified in yeast (Kelff et 
ai. (1995) J. Biol Chem. 270:24674-24677), the molecular entities responsible for 
10 histone deacetylation were heretofore unknown in the art. 

The identification of the mechanism by which histones are deacetylated would be 
of great benefit in the control of gene transcription and the cell cycle. 

Summary of the Invention 

15 The present invention relates to the discovery of a novel faixuly of genes, and gene 

products, expressed in mammals, which genes are referred to hereinafter as the "histone 
deacetylase" genes or '*HDx'' gene family, the products of which are referred to as 
histone deacetylases or HDx proteins. 

In general, the invention features isolated HDx polypeptides, preferably 
20 substantially pure preparations of one or more of the subject HDx polypeptides. The 
invention also provides recombinantly produced HDx polypeptides. In preferred 
embodiments the polypeptide has a biological activity including an ability to deacetylate 
an acetylated histone substrate, preferably a substrate analog of histone H3 and/or H4. In 
other embodiments the HDx polypeptides of the present invention bind to trapoxin or to 
25 trichostatin, such binding resulting in the inhibition a deacetylase activity of the HDx 
polypeptide However, HDx polypeptides which specifically antagonize such activities, 
such as may be provided by dominant negative mutants, are also specifically 
contemplated. 

The HDx polypeptides disclosed herein are capable of modulating proliferation, 
30 survival and/or dififerentiation of cells, because of their ability to alter chromatin structure 
by deacetylating histones such as H3 or H4. Moreover, in preferred embodiments, the 
subject HDx proteins have the ability to modulate cell growth by influencing cell cycle 
progression or to modulate gene transcription. 
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In one embodiment, the polypeptide is identical with or homologous to an HDx 
protein. Exemplary HDx polypeptide include amino acid sequences represented in any 
one of SEQ ID Nos 5-8. Related members of the HDx family are also contemplated, for 
mstance, an HDx polypeptide preferably has an amino acid sequence at least 85% 
homologous to a polypeptide represented by one or more of the polypeptides designated 
SEQ ID Nos: 5-8, though polypeptides with higher sequence homologies of, for example, 
88, 90% and 95% or are also contemplated. In one embodiment, the HDx polypeptide is 
encoded by a nucleic acid which hybridizes under stringent conditions with a nucleic acid 
sequence represented in one or more of SEQ ID Nos. 1-t. Homologs of the subject HDx 
protems also include versions of the protein which are resistant to post-translation 
modification, as for example, due to mutations which alter modification sites (such as 
tyrosine, threonine, serine or aspargine residues), or which inactivate an enzymatic 
activity associated with the protein. 

The HDx polypeptide can comprise a fuU length protein, such as represented in 
SEQ ID No. 5, or it can comprise a fragment corresponding to particular motifs/domains, 
or to arbitrary sizes, e.g., at least 5, 10, 25, 50. 100, 150 or 200 amino acids in length. Iii 
preferred embodiments, the polypeptide, or firagment thereof, specifically deacetylates 
histone H4. In other preferred embodiments, the HDx polypeptide includes both a v 
motif (SEQ ID No. 12) and a x motif (SEQ ID No. 14), preferably a v motif represented 
m the general formula SEQ ID No. 13. and a x motif represented in the general formula 
SEQ ID No. 15. 

In certain preferred embodiments, the invention features a purified or recombinant 
HDx polypeptide having a molecular weight in the range of 40kd to 60kd. For instance, 
preferred HDx polypeptides, have molecular weights in the range of 50kd to about 60kd 
even more preferably in the range of 53-58kd. It will be understood that certain post- 
translational modifications, e.g., phosphorylation, prenylation and the Uke. can increase 
the apparent molecular weight of the HDx protein relative to the unmodified polypeptide 
chain. 

The subject proteins can also be provided as chimeric molecules, such as in the 
form of fiision proteins. For instance, the HDx protein can be provided as a recombinant 
fusion protein which includes a second polypeptide portion, e.g., a second polypeptide 
havmg an amino acid sequence unrelated (heterologous) to the HDx polypeptide, e g the 
second polypeptide portion is glutathione-S-transferase. e.g. the second pol^eptide 
portion ,s an enzymatic activity such as alkaline phosphatase, e.g. the second polypeptide 
35 portion is an epitope tag. 



20 



25 



30 



wo 97/35990 PCT/US97/05275 



In yet another embodiment, the invention features a nucleic add encoding a an 
HDx polypeptide, or polypeptide homologous thereto, which polypeptide has the ability 
to modulate, e.g., either mimic or antagonize, at least a portion of the activity of a wild- 
type HDx polypeptide. Exemplary /ttbc-encoding nucleic acid sequences are represented 
5 by SEQIDNos: 1-4. 

In another embodiment, the nucleic acid of the present invention includes a 
coding sequence which hybridizes under stringent conditions with one or more of the 
nucleic acid sequences in SEQ ID Nos: 1-4, The coding sequence of the nucleic acid can 
comprise a sequence which is identical to a coding sequence represented in one of SEQ 
10 ID Nos: 1-4, or it can merely be homologous to one or more of those sequences. In 
preferred embodiments, the nucleic acid encodes a polypeptide which specifically 
modulates, by acting as either an agonist or antagonist, the enzymatic activity of an HDx 
polypeptide. 

Furthermore, in certain preferred embodiments, the subject HDx nucleic acid will 
15 include a transcriptional regulatory sequence, e g, at least one of a transcriptional 
promoter or transcriptional enhancer sequence, which regulatory sequence is operably 
linked to the HDx gene sequence. Such regulatory sequences can be used in to render 
the HDx gene sequence suitable for use as an expression vector. This invention also 
contemplates the cells transfected with said expression vector whether prokaryotic or 
20 eukaryotic and a method for producing HDx proteins by employing said expression 
vectors. 

In yet another embodiment, the nucleic acid hybridizes under stringent conditions 
to a nucleic add probe corresponding to at least 12 consecutive nucleotides of either 
sense or antisense sequence of one or more of SEQ ID Nos: 1-4; though preferably to at 
25 least 25 consecutive nucleotides; and more preferably to at least 40, 50 or 75 consecutive 
nucleotides of either sense or antisense sequence of one or more of SEQ ID Nos: 1-4. 

Yet another aspect of the present invention concerns an immunogen comprising 
an HDx polypeptide in an immunogenic preparation, the inununogen being capable of 
eliciting an immune response specific for an HDx polypeptide; e.g. a humoral response, 
30 e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the 
immunogen comprising an antigenic determinant, e.g. a unique determinant, fi-om a 
protein represented by one of SEQ ID Nos. 5-8. 

A still further aspect of the present invention features antibodies and antibody 
preparations specifically reactive with an epitope of the HDx immunogen. 
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The mvention also features transgenic non-human animals, e.g. mice, rats, rabbits 
chickens, frogs or pigs, having a transgene. e.g., animals which include (and preferably 
express) a heterologous fonn of an gene described herein, or which misexpress an 
endogenous HDx gene, e.g., an animal in which expression of one or more of the subject 
> ffDx protems is disrupted. Such a transgenic animal can serve as an animal model for 
studymg ceUuIar and tissue disorders comprising mutated or mis-expressed HDx alleles 
or for use in drug screening. 

The invention also provides a probe/primer comprising a substantially purified 
ohgonucleot.de, wherein the oligonucleotide comprises a region of nucleotide sequence 
which hybndizes under stringent conditions to at least 12 consecutive nucleotides of 
sense or ant.sense sequence of SEQ ID Nos: 1-4, or naturally occurring mutants thereof 
Nucleic acid probes which are specific for each of the HDx proteins are contemplated by 
the present invention, e.g. probes which can discern between nucleic acid encoding a 
human or bovine HD. In preferred embodiments, the probe/primer further includes a 
label group attached thereto and able to be detected. The label group can be selected 
e.g.. from a group consisting of radioisotopes, fluorescent compounds, enzymes and 
enzyme co-factors. Probes of the invention can be used as a part of a diagnostic te'st kit 
for Identifying dysfunctions associated with mis-expression of an /OJr protein, such as 
for detecting m a sample of cells isolated from a patient, a level of a nucleic acid encoding 
a subject HDx protein; e.g. measuring an HDx mRNA level in a cell, or determining 
whether a genomic HDx gene has been mutated or deleted. These so caUed 
probes/pnmers" of the invention can also be used as a part of "antisense" therapy which 
refers to admimstration or m siiu generation of oligonucleotide probes or their 
derivatives which specifically hybridize (e.g. bind) under ceUuIar conditions, with the 
cellular mRNA and/or genomic DNA encoding one or more of the subject HDx proteins 
so as to mhibit expression of that protein, e.g. by inhibiting transcription and/or 
translation. Preferably, the oligonucleotide is at least 12 nucleotides in length, though 
pnmers of 25. 40, 50. or 75 nucleotides in length are also contemplated. 

In yet another aspect, the invention provides an assay for screening test 
compounds for inhibitor., or alternatively, potentiators, of an interaction between an 
^ protein and an HDx binding protein or nucleic acid sequence. An exemplary 
method includes the steps of (i) combining an HDx polypeptide or fragment thereof one 
or more HDx target polypeptide (such as a histone, SIN3, RpAp48 or other protein 
which participates in HDx complexes, e.g.. one or more proteins having molecular 
weights of 250 kDa, ISO kDa, 55 kDa, 50 kDa, 42 kDa, 33-36 kDa and 30 see also 
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Example 3), and a test compound, e.g., under conditions wherein, but for the test 
compound, the HDx protein and target polypeptide(s) are able to interact; and (ii) 
detecting the formation of a complex which includes the HE>x protein and target 
polypeptide(s) either by directly quantitating the complex, the deacetylase activity of the 
5 HDx protein, or by measuring inductive effects of the HDx protein. A statistically 
significant change, such as a decrease, in the formation of the complex in the presence of 
a test compound (relative to what is seen in the absence of the test compound) is 
indicative of a modulation, e.g,, inhibition, of the interaction between the HDx protein 
and its target polypeptide. 

10 Furthermore, the present invention contemplates the use of other homologs of the 

HDx polypeptides or bioactive jfragments thereof to generate similar assay formats. In 
one embodiment, the drug screening assay can be derived with a fungal homolog of an 
HDx protein, such as RPD3, in order to identify agents which inhibit histone 
deacetylation in a yeast cell. 

15 Yet another aspect of the present invention concerns a method for modulating 

one or more of growth, difierentiation, or survival of a mammalian cell by modulating 
HDx bioactivity, e.g., by inhibiting the deacetylase activity of HDx proteins, or disrupting 
certain protein-protein interactions. In general, whether carried out in vivo, in vitro, or 
in situ, the method comprises treating the cell with an effective amount of an HDx 

20 therapeutic so as to alter, relative to the cell in the absence of treatment, at least one of 
(i) rate of growth, (ii) differentiation, or (iii) survival of the cell. Accordingly, the 
method can be carried out with HDx therapeutics such as peptide and peptidomimetics or 
other molecules identified in the above-referenced drug screens which antagonize the 
effects of a naturally-occurring HDx protein on said cell. Other HDx therapeutics include 

25 antisense constructs for inhibiting expression of HDx proteins, and dominant negative 
mutants of HDx proteins which competitively inhibit protein-substrate and/or protein- 
protein, interactions upstream and downstream of the wild-type HDx protein. 

In an exemplary embodiment the subject method is used to treat tumor ceUs by 
antagonizing HDx activity and blocking cell cycle progression. In one embodiment, the 

30 subjea method includes the treatment of testicular cells, so as modulate spermatogenesis. 
In another embodiment, the subject method is used to modulate osteogenesis, comprising 
the treatment of osteogenic cells with an HDx polypeptide. Likewise, where the treated 
cell is a chondrogenic cell, the present method is used to modulate chondrogenesis. In 
still another embodiment, HDx polypeptides can be used to modulate the differentiation 

35 of progenitor cells, e.g., the method can be used to cause differentiation of a 
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hematopoietic cells, neuronal cells, or other stem/progenitor ceU populations, to maintain 
a that cell in a differentiated state, and/or to enhance the survival of a differentiated cdl, 
e.g., to prevent apoptosis or other forms of cell death. 

In addition to such HDx therapeutic uses, anti-fungal agents developed with such 
5 screening assays as described herein can be used, for example, as preservatives in 
foodstuff, feed supplement for promoting weight gain in livestock, or in disinfectant 
formulations for treatment of non-living matter, e.g., for decontaminating hospital 
equipment and rooms. In similar fashion, assays provided herein will permit selection of 
deacetylase inhibitors which discriminate between the human and insect deacetylase 
10 enzymes. Accordingly, the present invention expressly contemplates the use and 
formulations of the deacetylase inhibitors in insecticides, such as for use in management 
of insects Uke the fruit fly. Moreover, certain of the inhibitors can be selected on the 
basis of inhibitory specificity for plant ffl>r-related activities relative to the mammalian 
enzymes. Thus, the present invention specifically contemplates formulations of 
deacetylase inhibitors for agricultural applications, such as in the form of a defoUant or 
the Uke. 

The present metiiod is applicable, for example, to ceU culture technique, such as 
in the culturing of hematopoietic ceUs and oUier cells whose survival or differentiative 
state is dependent on HDx fimction. Moreover, HDx agonists and antagonists can be 
used for therapeutic intervention, such as to enhance survival and maintenance of ceUs, as 
well as to influence organogenic pathways, such as tissue patterning and other 
differentiation processes. In an exemplary embodiment, the method is practiced for 
modulating, in an animal, cell growth, ceU differentiation or cell survival, and comprises 
administering a therapeutically effective amount of an HDx polypeptide to alter, relative 
tiie absence oiHDx treatment, at least one of (i) rate of growti,, (ii) differentiation, or 
Cm) survival of one or more cell-types in the animal. 

Anotiier aspect of the present invention provides a method of determining if a 
subject, e.g. a human patient, is at risk for a disorder characterized by unwanted ceU 
proliferation or aberrant control of differentiation. The method includes detecting, in a 
tissue of tiie subject, tiie presence or absence of a genetic lesion characterized by at least 
one of CO a mutation of a gene encoding an HDx protein, e.g. represented in one of SEQ 
ID Nos: 1-4. or a homolog thereof; or (ii) die mis-expression of an HDx gene. In 
preferred embodimems, detecting tiie genetic lesion includes ascertaining tiie existence of 
at least one of a deletion of one or more nucleotides fi-om an HDx gene; an addition of 
one or more nucleotides to the gene, a substitution of one or more nucleotides of tiie 
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gene, a gross chromosomal rearrangement of the gene; an alteration in the level of a 
messenger RNA transcript of the gene; the presence of a non-wild type splicing pattern of 
a messenger RNA transcript of the gene; or a non-wild type level of the protein. 

For example, detecting the genetic lesion can include (i) providing a probe/primer 
S including an oligonucleotide containing a region of nucleotide sequence which hybridizes 
to a sense or antisense sequence of^nHDx gene> e.g. a nucleic acid represented in one of 
SEQ ID Nos: 1-4, or naturally occurring mutants thereof, or 5* or 3* flanking sequences 
naturally associated with the HDx gene; (ii) exposing the probe/primer to nucleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the 

10 presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises 
utilizing the probe/primer to determine the nucleotide sequence of the HDx gene and, 
optionaUy, of the flanking nucleic acid sequences. For instance, the probe/primer can be 
employed in a polymerase chain reaction (PCR) or in a ligation chain reaction (LCR). In 
alternate embodiments, the level of an HDx protein is detected in an immunoassay using 

15 an antibody which is specifically immunoreactive with the HDx protein. 

In another aspect, the invention provides compounds usefiil for inhibition of 
HDxs. In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C, in which A is a specificity element for selective 
binding to an HDx, B is a linker element, and C is an electrophilic moiety capable of 
20 reacting with a nucleophilic moiety of an /ffibr. with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

For instance, in one embodiment, there is provided a composition for inhibiting a 
histone deacetylase comprising a compound represented by the general formula A-B-C, 
wherein 

25 A is selected fi-om the group consisting of cycloalkyls, unsubstituted and 

substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected fi-om the group consisting of substituted and unsubstituted C4-Cg 
alkylidenes, C4-C8 alkenylidenes, C4-Cg alkynylidenes, and -(D-E-F)-, in v*ich D and F 
are, mdependently, absent or represent a C2-C7 alkylidene, a C2-C7 alkenylidene or a C2- 
30 Cj alkynylidene, and E represents O, S, or NR', in which R' represents H, a lower alkyl, a 
lower alkenyl, a lower alkynyl,,an aralkyl, aryl, or a heterocyclyl; and 

C is selected firom the group consisting of 
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•R'6 ^ ^V^T^R'^ A /P, 



25 



o o- o o- ... 

' ' ' . and a boromc acid; in 

which Z represents O. S. or NR5, and Y; R5 represents a hydrogen, an alkyl an 
alkoxycarbonyl, an aryloxycarbonyl. an alkyisulfonyl, an arylsulfonyl or an aryl R'. 
represents hydrogen, an alkyl, an alkenyl. an alkynyl or an aryl; and R7 represents a 
5 hydrogen, an alkyl, an aryl. an alkoxy. an aryloxy. an amino, a hydroxyiamino an 
alkoxylamino or a halogen; with the proviso that the compound is not trapoxin. 

In another preferred embodiment, the compound represented by the general 
formula A-B-C, wherein 

A is selected from the group consisting of cycloalfcyls, unsubstituted and 
10 substituted aryis. heterocyclyls, amino acyls, and cyclopeptides; 

B IS selected from the group consisting of substituted and unsubstituted C4-C8 
alkyhdenes, C4-C8 alkenylidenes, C4-Cg alkynylidenes, and -{D-E-FK in which D and F 
are, independently, absent or represent C2-C7 alkylidenes. Cj-C^ alkenyUdenes or C2-C7 
alkynyhdenes, and E represents O. S, or NR'. in which R' represents H, a lower alkyl a 
lower alkenyl, a lower alkynyl, an aralkyl, an aryl, or a heterocyclyl; and 
C is selected from the group consisting of 

H H 6 . . 

. in which R9 represents a hydrogen, an alkyl 
an aryl, a hydroxyl, an alkoxy. an aryloxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 

In stiU another preferred embodunent, the compound is represented by the general 
formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
subsututed aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisUng of substituted and unsubstituted C4.C8 
aJfcybdenes, C4-C8 alkenylidenes, C4-C8 alkynyUdenes, and -(D-E-F)-, in which D and F 
are, mdependently, absent or a C2-C7 alkylidene. a C2-C7 alkenylidene, or a C2-C7 
aikynylidene, and E represents O. S, or NR'. in which R' is H, lower alkyl, lower alkenyl 
lower alkynyl, aralkyl, aryl, or heterocyclyl; and 
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V 

C represents ; in which Y is O or S, and R7 represents a hydrogen, an 

alkyl, an giryl, an alkoxy, an aryloxy, an amino, a hydroxylamino, an alkoxylamino or a 
halogen. 

The present invention also contemplates pharmaceutical preparations of such 
5 compounds, e.g., in an amount eflFective for inhibiting proliferation of a cell, fonnulated 
in a pharmaceutically acceptable diluent. 

Moreover, such compounds can be used for modulating one or more of growth, 
difiFerentiation, or survival of a mammalian cell responsive to //Z>r-mediated histone 
deacetylation, by treating the cell with an eflFective amount of the deacetyiase inhibitor so 
10 as to modulate the deacetyiase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the rate of 
survival of the cell. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic 

15 biology, microbiology, recombinant DNA, and immunology, which are within the skill of 
the art. Such techniques are explained fiilly in the literature. See, for example. Molecular 
Cloning A Laboratory Manual, 2nd Ed., ed, by Sambrook, Fritsch and Maniatis (Cold 
Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and n (D. N. Glover 
ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. US. Patent 

20 No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D, Hames & S. J. Higgins eds. 1984); Culture Of 
Animal Cells (R. I. Frcshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes 
QKL Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the 
treatise. Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Veaors 

25 For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor 
Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.). 
Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds.. 
Academic Press, London, 1987); Handbook Of Experimental Inununology, Volumes I- 
IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, 

30 (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 
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Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

Brief Description of the Drmrings 
5 Figure 1 A illustrates the chemical stmctures of trapoxin and tiichostatin, natural 

products that inhibit the enzymatic deacetylation of lysine residues near the NHj-temunus 
of histones. The epoxyketone side chain of trapoxin is approximately isosteric with N- 
acetyl lysine and likely alkylates an active site nucleophUe. 

Figure IB illustrates the copurification of trapoxin binding and histone 
10 deacetylase activities. Nuclear proteins from bovine thymus were precipitated with 
ammonium sulfate and fractionated on a Mono Q column. Trapoxin binding was 
assayed by charcoal precipitation with pHJtrapoxin. For the histone deacetylase assay, 
a peptide corresponding to bovine histone H4 (1-24) was synthesized. The peptide wa^ 
chemically acetylated with sodium [3H]acetate (5.3 Ci/mmol. New England 
Nuclear)/BOP reagent (Aldrich) and purified by reverse phase HPLC. Two microUters of 
f H]peptide(~40,000dpm) were used per 200 nl assay. After incubation at 370C for one 
hour, the reaction was quenched with 1 M HCl/0.16 M acetic acid (50 ^1). Released 
[ H]acetic acid was extracted with 600 m of ethyl acetate and quantified by scintiUation 
countmg. Pretreatment of crude or partially purified enzyme with trapoxin or trichostatin 
(20nM) for 30 min. at 40C abolished deacetylase activity. A^go^ absorbance at 280 mn. 

Figure 2A shows the synthesis of K-trap and the K-trap aflSnity matrix K-trap 
contams a protected lysine residue in place of the phenylalanine at position two in 
trapoxin. Alloc = allyloxycarbonyl. 

Figure 2B is a silver stained gel showing bovine and human trapoxin binding 
proteins. Proteins bound to the K-trap affinity matrix in the presence or absence of- 
trapoxin or trichostatin were eluted by boiling in SDS loading buffer and analyzed by 
SDS-PAGE (9% gel). Nuclear proteins from human Jurkat T cells were prepared 
Identically to those from bovine thymus (Figure IB). Molecular size standards (in 
kilodaltons) are indicated to the right. 

Figure 3A is the predicted amino acid sequence of human HDI. An in-frame stop 
codon was found upstream of the starting methionine. Regions equivalent to 
microsequenced tryptic peptides from the purified bovine protein are boxed Underlined 
anuno acids 319-334 and 467-482 denote the sequences of synthetic peptides that were 
conjugated to KLH and used to generate polyclonal antisera. Abbreviations for the 
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amino acid residues are: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, lie; K, 
Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y,Tyr. 

Figure 3B is a protein immunoblot analogous to the silver stained gel in Figure 
5 2B, showing the relationship between bovine p46-p49 and human p55 (top panels) and 
confirming the identity of p50 (bovine and human) as RbAp48 (bottom panels). Proteins 
eluted from the K-trap aflBnity matrix (Figure 2) were separated by SDS-PAGE and 
transferred to Immobilon-P (Millipore). Blots were probed with polyclonal smti-HDl 
(319-336) or monoclonal anti-RbA^8 and bound antibodies were detected with 
10 enhanced chemiluminescence (Amersham). 

Figure 4A is an immunoprecipitation of endogenous histone deacetylase activity 
with affinity purified anti-ZTDl (467^82) antibodies. Anti-JTOl (467-4 82) 

inmiunoprecipitates firom equivalent amounts of Jurkat nuclear extract (1 mg nuclear 
protein supplemented with 0.5 M NaCl, 1% BSA, and 0J% NP-40) were isolated in the 
15 presence or absence of /K) 1(467-482) peptide competitor. After resuspending the 
immunoprecipitates in HDx buffer [20 mM tris (pH 8), 150 mM NaCl, 10% glycerol], 
inhibitors were added as indicated, and histone deacetylase activity was measured as 
described in Figure 1 A. 

Figure 4B shows the coprecipitation ofHDl and RbAp48, as detected by protein 
20 inmiunoblot analysis. 

Figure 4C demonstrates the histone deacetylase activity of recombinant HDl-F. 
Tag Jurkat cells (CHpstone et al. (1992) Nature 357, 695-7) were transfected with pFJ5 
(vector alone) or pBJ5/HDl-F (encoding COOH-terminal FLAG epitope tagged HDJ) 
by electroporation and detergent lysates were prepared [0.5% Triton X-100, 50 mM tris 
25 (pH 8), 100 mM NaCl, 10% glycerol]. Anti-FLAG antibodies conjugated to agarose 
beads (IBI) were used to immunoprecipitate recombinant HDJ in the presence or absence 
of FLAG peptide competitor, and histone deacetylase activity was measured as described 
above. 

Figure 4D shows the interaction between recombinant HDl-F and the K-trap 
30 affinity matrix. Lysates from Jurkat cells transfected with pBJ5/HDl-F were incubated 
with the K-trap affinity matrix in the presence or absence of inhibitors. Immunoblots of 
the eluted proteins were probed with the anti-FLAG M2 monoclonal antibody (IBI). 

Figures 5 A and 5B are sequence alignments for various /fZ>r and //Z)jc-related 
cDNAs and proteins, respectively. 
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Figure 6 depicts exemplaiy specificity elements (A), linker elements (B), and 
electrophilic moieties (C) for generating compounds which are capable of rearting with a 
nucleophilic moiety of an HDx protein. 

Figure 7 illustrates an exemplary synthesis of trichostatin analogs. 
5 Figures 8A-8C illustrate a synthesis of tritiated Trapoxin B. 

Figures 9A-9C depict a synthesis of the K-trap and K-trap affinity matrix . 
Figures lOA-lOB: mSinSA is present in cells as a large stable multiprotein 
complex. Nuclear lysates were prepared fi-om U937 ceUs metabolically labeled with 
[35]S-methionine and low stringency immunoprecipitations performed with antisenim 
10 specific for mSinSA. "+ block" shows proteins immunoprecipitated when the anti- 
mSinS A was preincubated with purified GST-PAH2 (A). In (B), low stringency mSinS A 
immunoprecipitates were washed for an additional 60 minutes using the salt and 
detergent conditions indicated at the top of the Figure. In (A) and (B), the 
immunoprecipitates were analyzed by SDS-PAGE and autoradiography. Apparent 
molecular weight of the coprecipitating proteins and the sizes of the molecular weight 
markers are given m kilodaltons. 

Figures UA-D: mSin3A and EMACI associate in vivo. Immunoprecipitations 
were performed using nuclear extraas fi-om [35]S-methionine labeled U937 ceUs. (A) The 
left lane shows proteins fi-om an anti-mSin3A immunoprecipitate. The right lane shows 
proteins eluted fi-om an anti-mSin3A immunoprecipitate and reprecipitated with anti- 
serum specific for HDl. In (B) and (G), low stringency immunoprecipitations were 
performed using antiserum specific for Uie carboxy-terminus of HDl. "+ block" indicates 
tiiat the HDl antiserum was preincubated with the inununizing peptide. In (C). proteins 
immunoprecipitated witii anti-mS3A are shown for reference, proteins eluted fi-om a low 
stringency anti-HDl immunoprecipitate and reprecipitated with anti-mSin3A are shown 
in tiie right most lane. In (A), (B), and (C), autoradiographs of SDS-PAGE gels are 
shown. Apparent molecular weight of the coprecipitating, proteins and tiie sizes of the 
molecular weight markers are given in kilodaltons. In (D), in vitro histone deacetylase 
acuvity in anti-mSin3A immunoprecipitates is shown. Human Jurkat cell extracts (12 
mg) were immunoprecipitated using anti-mSin3A polyclonal antibodies, -•4-block" 
mdicates that the anti-mSin3A antibody was preincubated with GST-PAH2, "+10 nM 
trapoxin" indicates tiiat tiie immunoprecipitated proteins were pretreated with 10 nM 
trapoxin for 30 minutes at 4»C prior to being assayed for histone deacetylase activity. 
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Figures 12A-C: RbAp48 is associated with mSinSA in vivo and recombinant 
HDl, RbAp48 and mSinSA copurify from insect cell extracts. (A) TAg Jurkat cell 
lysates were immunoprecipitated using antibodies specific to C-temiinalal portion of HDl 
(left) or antibodies specific to PAH2 of mSin3A (right). Parallel immunoprecipitations 
5 were blocked as described in figure 1 L Immunopurified proteins were analyzed by SDS- 
PAGE and immunoblotted with (x-RbAp48 monoclonal antibody 12B1, (B & C) Equal 
amounts of baculovirus coinfected Sf9 cell extracts were aflSnity purified using Ni2-t"- 
agarose "Ni" or or x-FLAG-M2-agarose "F". Purified recombinant proteins were 
analyzed by SDS-PAGE, transferred to Immobilon-P (Millipore) and immunoblotted with 
10 FLAG to detect HDl-F (B) or (x-Flu (12CA.5) to detect p48.HA (C). We observe a 
reduction in expression of HDI-F and p48-HA when coexpressed with mSin3A. 

Figures 13A-C: Trapoxin reverses transcriptional repression by mSinSA. (A) 
The structure of the minimal reporter gene derived fi-om the myelomonocytic growth 
factor gene and the expression vectors. Mad(Pro)N35GALVPI6 has leucine at position 

15 12 and alanine at position 16 mutated to proline as indicated. These point mutations 
prevent association between mSin3A and Mad (Ayer et al., 1995). The transcriptional 
activity of MadN35GALVP16 and Mad(Pro)N35GALVP16 was determined by 
measuring luciferasc activity (Relative Light Units, RLU) of transfected 293 cells 
following an 8 hour treatment with 0 (solid bars) or 10 nM trapoxin (striped bars) (B). 

20 To control for differences in transfection efficiency, the RLU values were normalized to 
the P-galactosidase activity produced by a cotransfectcd CMV-PGAL construct. Shown 
is data fi-om representative experiment and the error is reported as the standard error of 
the mean (s.e.m). This experiment has been done a minimum of five times in triplicate 
with similar results. An 8 hour treatment of 293 cells with 10 nM trapoxin is within the 

25 linear range of the response of the reporter gene. Furthermore, trapoxin treatment did 
not prevent association between mSinSA and HDl (data not shown). (C), trapoxin 
inhibits histone deacetylase activity of human 293 cells in vivo. 2 x 10* cells were 
cultured for 8 hours in the absence, "O", or in the presence of 10 nM trapoxin. Cells 
were harvested and crude extraas fi-om approximately 1 x lO'^ cells (solid bars) or anti- 

30 HDl immunoprecipitations of extracts fi-om approximately 4 x lO^^ cells (white bars) 
were assayed for histone deacetylase activity in vitro. 
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Detailed Description of the Invention 

The positioning of nucleosomes relative to particular regulatory elements in 
genomic DNA has emerged as a mechanism for managing the association of sequence- 
specific DNA-binding proteins with promoters, enhancers and otiaer transcriptional 
regulatory sequences. Two modifications to nucleosomes have been observed to 

mXVt ""T""" ^^^N^-^-^J-g P-teins with chromatin. Depletion of histones 
H2A^B fi-om the nucleosome facilitates the binding of RNA polymerase H (Baer et al 
(1983) Nature 301:482-488) and TFIIIA (Hayes et al. (1992) PNAS 89:1229-1233) 
Likewise, acetylation of the core histones ^parently destabilizes tiie nucleosome and is 
thought to modulate the accessibility of transcription factors to their respective enhancer 

TnZ71\"'TT ^"""^ -'"^^^ 18:2739-2747; and Walker et 

al. ( 990) y A./ Chem 265:5622-5746). In both cases, overall histone-DNA contacts 
are altered. 

In one aspect, the present invention concerns the discovery of a family of genes in 

^° ^•^^•^ ^ "'^^^t""^ deacetylases" or 
MUX'S . Expenmental evidence indicates a fimctional role for the HDx gene produas as 
catalysts of the deacetylation of histones in mammalian ceUs, and accordingly play a role 
m detenmnmg tissue fete and maintenance. For instance, the results provided below 
indicate that proteins encoded by the HDx genes may participate, under various 
circumstances, in the control of proliferation, differentiation and cell death. 

The family of ^Z>r gene apparently encode at least three different sub-families 
eg., paralogs, and have been identified fi-om the cells of various mammals The HD, 
protems were first isolated from bovine thymus nuclei by use of a binding assay which 
exp oued the ability of trapoxin, an inhibitor of histone deacetylase activity, to isolate 
protems which co-purified witii a histone acetylase activity. The partial identity of the 
isolated protems were determined by peptide microsequencing, and primers based on the 
^.de sequences were used to clone human cDNAs from a T ceO hbraiy. One of tiie 

SEQ ID No. 1 (nucleotide) and SEQ ID No. 5 (amino acid). 

^^*='*°^«P^essed sequence tag (EST) libraries turned up partial sequences 
for human HDx transcripts, and revealed the existence of at least two other human HDx 
genes related to HDJ, these other paralogs referred to herein as HD2 and HD3 
Nucleotide and amino acid sequences for partial clones of other human HDx homologs 
are provided by SEQ ID Nos. 2-4 and 6-8, respectively. ^ 
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Analysis of the HDx sequences indicated no obvious simUarities with any 
previously identified domains or motifs. However, the feet that each fiiU-length clone 
lacks a signal sequence, along with the observation that proteins can be detected in the 
nucleus, indicates that the HDx genes encode intraceUular proteins. 

5 Careful inspection of the HDx clones suggests at least two novel motifs, one or 

both of which may be characteristic of at least subfamilies of the mammalian HDx family. 
The first apparently conserved structural element of the HDx family occurs in the N- 
terminal portion of the molecule, and is designated herein as the "v motiT. With 
reference to human HDl, the v motif corresponds to amino acid residues Aspl30- 

10 Phel98. By alignment of the human /frbc sequences, the element is represented by the 

consensus sequence: 

DXXXNXXGGLHHAKKXEASGFCYX^©IVXXIXEIXXYHX^ 
VMTXSF, (SEQ ID No. 12) 
more preferably by the consensus sequence: 
15 DlAXiNWAGC5LHHAKKX2EASGFCYVlTOIVX3X4lI^lXYHX5RVLYlDIDI^^ 
7TD-RVMrVSF (SEQ ID No. 13) 

wherein each of X„ represents any single amino acid, though more preferably represents 
an amino acid residue in the corresponding human HDx sequences of the appended 
sequence listing, 

20 A second motif, herein designated the x motif is represented by the consensus 

sequence: 

CVXXXKXFXXPXXXXGGGGYTXRNVARXWXXET (SEQ ID No. 14) 
more preferably by the consensus sequence: 

CVEX8VKX9FNXioPLLXnLGGGGYTXi2RNVARC\mET (SEQIDNo:15) 
25 wherdn each of X„ represents any single amino acid, though more preferably represents 

an amino acid residue in the corresponding human HDx sequences of the appended 

sequence listing. The % motif can be found in the human HZ)/ sequence at C284-Thr316. 
The femily of HDx proteins apparently ranges in size fi-om about 40kd to about 

60kd for the umnodified polypeptide chain. For instance, the bovine HDl protem 
30 migrates on an SDS-PAGE (9%) gel with an apparent molecular weight of 461cD. The 

human HDl amino acid sequence predicts a molecular weight for the polypeptide chain 

ofSSki). 
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Accordingly, certain aspects of the present invention relate to nucleic acids 
encoding HDx proteins, the HDx proteins themselves, antibodies immunoreactive with 
HDx proteins, and preparations of such compositions. Moreover, the present invention 
provides diagnostic and therapeutic assays and reagems for detecting and treating 
disorders involving, for example, aberrant expression (or loss thereoO of HDx homologs 
In addition, daig discovery assays are provided for identifying agents which can modulate 
the biological function of HDx proteins, such as by altering the binding of HDx molecules 
to either proteins or nucleic acids. Such agents can be useful therapeutically to alter the 
growth and/or differentiation of a ceU. Other aspects of the invention are described 
below or will be apparent to those skilled in the art in light of the present disclosure. 

Analysis of the human HDx sequences, while not revealing any obvious 
similanties to known domains or motifs, did indicate similarities with previously identified 
P^otcms ^omhoihSaccharomycescerevisiaeimdXenopuslaevis. Those genes RPD3 
(SEQ ID No. 9) and Xe-RPD3 (SEQ ID No. 10). respectively, had not previously been 
ascribed any specific function. However, based on our observations for the fimction of 
HDl, It IS now apparent that each of these other proteins are also deacetylases. and 
represent potential therapeutic targets. Accordingly, drug discovery assays are prodded 
for Identifying agents which can modulate the biological function of "JTODc-related" 
proteins, such as RPD3 homologs. by altering the enzymatic activity of the deacetylase. 
or Its binding to other cellular componems including homologs of RbAp48 (described 
mfra). Such agents can be useful therapeutically to alter the growth and/or differentiation 
of non-human cells, such as in the treatment of mycotic infections, or as additives to 
livestock feed, e.g.. to promote weight gain, or as topical antiseptics for sterilizing 
medical equipment. 

In addition we isolated another bovine protein having an approximate molecular 
size of 50kD which apparently binds HDx proteins isolated by the trapoxin matrix, and 
microsequencing of that protein demonstrated that it was related to the protein referred 
to m the art as RbAp48 (Qian et al. (1993) Nature 364:648; SEQ ID No. 1 1) RbAp48 
was originally identified as a protein that binds to the retinoblastoma (Rb) gene product 
The retmoblastoma (RB) gene product plays a role in tumor suppression (Weinberg, 
R-A, (Sept 1988) Scientific Amer.^p 44-51; Hansen et al. (1988) Trends Genet A:\25. 
128). The role of RB as a tumor-suppressor protein in cell-cycle control is believed to be 
similar to that of another tumor-suppressor, p53 (Green (1989) Cell 56.1-3 Mowat et al 
(1985 Nature 314:633-636). Inactivation or mutation of the second RB allele in one of 
the somatic cells of these susceptible individuals appears to be the molecular event that 
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leads to tumor fonnation (Caveneee et al. (1983) Nature 305:799-784; Friend et al. 
(1987) 84:9059-9063). 

The growth suppression function of the retinoblastoma protein is thought to be 
mediated by Rb binding to cellular proteins. RbAp48 is one of the major proteins that 
5 binds to a putative funcUonal domain at the carboxy terminus of the Rb protein. 
Complex formation between RbAp48 and Rb occurs in vitro and in vivo, and apparently 
involves direct interaaion between the proteins. Like Rb, RbAp48 is a ubiquitously 
expressed nuclear protein. RbAp48 share sequence homology with MSIU a negative 
regulator of the Ras-cyclic AMP pathway in the yeast Sacchca-omyces cerevisiae. 
10 Furthermore. Uke MSIJ, human RbAp48 suppresses the heat-shock sensitivity of the 
yeast iraJ strains and RAS2Vall9 strains. Interaction with RbAp48 may be one of the 
mechanisms for suppression of growth mediated by Rb. Accordingly, the interartion of 
RbAp48 with HDx proteins further implicates theiM>x proteins in cell-cycle regulation. 

The RpAp48 interaction with HDx and /ffibc-related proteins represents yet 
15 another therapeutic target. Accordingly, dmg discovery assays are provided for 
identifying agents which can modulate the interaction of RbAp48 proteins and the like 
with i/Z>*-related proteins. Such assays can be derived to detect the abUity of a test 
agent to alter protein-protein contaas, or to alter the enzymatic activity of the 
deacetylase in complexes including an RbAp48 protdn (e.g., were such complexes 
20 aUosterically modulate the HDx enzymatic artivity). As above, such agents can be useful 
therapeutically to alter the growth and/or differentiation of cells. 

Members of the Mad family of BHLHZip proteins heterodimerize with Max to 
repress transcription in a sequence-specific manner. Transcriptional repression by 
Mad:Max heterodimers is mediated by ternary complex formation with either of the 
25 corepressors mSin3A or mSin3B. Example 3 demonstrates that Sin3 proteins are an in 
vivo component of large, heterogeneous multiprotein complexes and is tightly and 
specifically associated with at least seven polypeptides. Two of the Sin3-associated 
proteins, p50 and p55, are members of the histone deacetylase famUy described herein. 
Sin3 immunecomplexes possess histone deacetylase activity that is sensitive to the 
30 specific deacetylase inhibitor trapoxin. Sm3 targeted repression of a reporter gene is 
reduced by trapoxin treatment, suggestmg that histone deacctylation mediates 
transcriptional repression through Mad-Max-Sin3 A multimeric complexes. 

The Sin3 interaction with HDx and if£bc-related proteins represents still another 
therapeutic target. Thus, in one aspect of the present invention there is provided drug 
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discovery assays for identifying agents which can modulate the interaction of Sin3 
proteins and the Uke with ^i>r-related proteins. Such assays can be derived to detect the 
abUity of a test agent to alter protein-protein contacts, or to alter the enzymatic activity 
of the deacetylase in complexes including Sin3 or other transcriptional regulatory 
5 proteins. As above, such agents can be useful therapeutically to alter the growth and/or 
differentiation of cells. 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
10 deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made from nucleotide analogs, and. as appUcable to the embodiment being 
described, single (sense or antisense) and double-stranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
comprising an open reading frame encoding one of the HDx polypeptides of the present 
invention, including both exon and (optionally) intron sequences. A "recombinant gene" 
refers to nucleic acid encoding an HDx polypeptide and comprising /ffibc-encoding exon 
sequences, though it may optionally include intron sequences which are either derived 
fi-om a chromosomal HDx gene or from an unrelated chromosomal gene. Exemplary 
recombinant genes encoding the subject HDx polypeptide are represented in the 
appended Sequence Listing. The term "intron" refers to a DNA sequence present in a 
given HDx gene which is not translated into protein and is generally found between 
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As used herein, the term "transfection" means the introduction of a nucleic acid. 
e.g.. an expression vertor. into a recipient cell by nucleic acid-mediated gene transfer 
"Transformation", as used herein, refers to a process in which a ceU's genotype is 
changed as a result of the cellular uptake of exogenous DNA or RNA, and, for example, 
the transformed cell expresses a recombinant form of an HDx polypeptide or, where anti- 
sense expression occurs from the transferred gene, the expression of a naturally- 
30 occurring form of the HDx protein is disrupted. 

As used herein, the term "specifically hybridizes" refers to the ability of the 
probe/primer of the invention to hybridize to at least 15 consecutive nucleotides of an 
HDx gene, such as an HDx sequence designated in one of SEQ ID Nos: 1-4, or a 
sequence complementary thereto, or naturally occurring mutants thereof, such that it has 
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less than 15%, preferably less than 10%, and more preferably less than 5% background 
hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) encoding a protein 
other than an HDx protein, as defined herein. In preferred embodiments, the 
oligonucleotide probe specifically detects only one of the subject HDx paralogs, e.g., 
5 does not substantially hybridize to transcripts for other HDx homologs in the same 
species. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of preferred 
vector is an episome, i.e,, a nucleic acid capable of extra-chromosomal replication. 

10 Preferred vectors are those c^q^able of autonomous replication and/expression of nucleic 
acids to which they are linked. Vectors capable of directing the expression of genes to 
which they are operatively linked are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of "plasmids" which refer generally to circular double ^tranded DNA loops which, 

15 in their vector form are not bound to the chromosome. In the present specification, 
"plasmid" and "vector" are used interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is intended to include such other forms of 
expression vectors which serve equivalent functions and which become known in the art 
subsequently hereto. 

20 "Transcriptional regulatory sequence" is a generic term used throughout the 

specification to refer to DNA sequences, such as initiation signals, enhancers, and 
promoters, which induce or control transcription of protein coding sequences with which 
they are operably linked. In preferred embodiments, transcription of one of the 
recombinant HDx genes is under the control of a promoter sequence (or other 

25 transcriptional regulatory sequence) which controls the expression of the recombinant 
gene in a cell-type in which expression is intended. It will also be understood that the 
recombinant gene can be under the control of transcriptional regulatory sequences which 
are the same or which are different fi-om those sequences which control transcription of 
the naturally-occurring forms of HDx genes. 

30 As used herein, the term "tissue-specific promoter" means a DNA sequence that 

serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which efifects expression of the selected DNA sequence in 
specific cells of a tissue, such as cells of hepatic, pancreatic, neuronal or hematopoietic 
origin. The term also covers so-called "leaky" promoters, which regulate expression of a 
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selected DNA primarily in one tissue, but can cause at least low level expression in other 
tissues as well. 

As used herein, a "transgenic animal" is any animal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art. The nucleic acid is introduced into the cell, directly or 
mdirectly by introduction into a precursor of the cell, by way of deUberate genetic 
manipulation, such as by microinjection or by infection with a recombinant vims. The 
term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but ratiier is directed to tiie introduction of a recombinant DNA molecule 
This molecule may be integrated within a chromosome, or it may be extrachromosomaUy 
rephcatmg DNA. In the typical transgenic animals described herein, the transgene causes 
ceUs to express a recombinant form of one of the HDx proteins, e.g. either agonistic or 
antagomstic forms. However, ti^genic animals in which the recombinant HDx gene is 
sUent are also contemplated, as for example, tiie FLP or CRE recombinase dependent 
constructs described below. - Moreover, "transgenic animal" also includes tiiose 
recombinant animals in which gene disruption of one or more HDx genes is caused by 
human intervention, including botii recombination and antisense techniques. 

The "non-human animals" of the invention include vertebrates such as rodents, 
non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred non- 
human animals are selected from the rodent femily including rat and mouse, most 
preferably mouse, tiiough transgenic amphibians, such as members of the Xenopus genus 
and transgenic chickens can also provide important tools for understanding and 
identifymg agents which can affect, for example, embryogenesis and tissue formation 
The mvention also comemplates transgenic insects, including those of the genus 
Drosophila, such as £>. meUmogaster. The term "chimeric animal" is used herein to refer 
to animals in which the recombinant gene is found, or in which the recombinant is 
expressed in some but not all cells of the animal. The term "tissue-specific chimeric 
animal- mdicates that one of the recombinant HDx genes is present and/or expressed or 
30 disrupted m some tissues but not others. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, 
e.g., one of the HDx polypeptides, or pending an antisense transcript thereto), which is 
partly or entirely heterologous, i.e.. foreign, to the transgenic animal or cell into which it 
IS mtroduced. or, is homologous to an endogenous gene of the transgenic animal or cell 
35 mto which it is introduced, but which is designed to be inserted, or is inserted, into the 
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animal's genome in such a way as to alter the genome of the cell into which it is inserted 
(e.g., it is inserted at a location which differs from that of the natural gene or its insertion 
results in a knockout). A transgene can include one or more transcriptional regulatory 
sequences and any other nucleic acid, such as introns, that may be necessary for optimal 
5 expression of a selected nucleic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be identical 
or may have certain modifications, including nucleotide substitutions, additions or 
deletions, which all still code for polypeptides having substantially the same activity. The 
10 term "DNA sequence encoding an HDx polypeptide" may thus refer to one or more 
genes within a particular individual. Moreover, certain differences in nucleotide 
sequences may exist between individuals of the same species, which are called alleles. 
Such allelic differences may or may not result in differences in amino acid sequence of the 
encoded polypeptide yet still encode a protein with the same biological activity. 

1 5 "Homology" refers to sequence similarity between two peptides or between two 

nucleic acid molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of 

20 the number of matching or homologous positions shared by the sequences. An 
"unrelated" or "non-homologous" sequence shares less than 40 percent identity, though 
preferably less than 25 percent identity, with one of the HDx sequences of the present 
invention. 

As used herein, an "^Dx-related" protein refers to the HDx proteins described 
25 herein, and other human homologs of those HDx sequences, as well as orthologs and 
paralogs (homologs) of the HDx proteins in other species, ranging from yeast to other 
mammals, e.g., homologous histone deacetylase. The term "ortholog" refers to genes or 
proteins which are homologs via speciation, e.g., closely related and assumed to have 
common descent based on structural and fiinctional considerations. Orthologous proteins 
30 function as recognizably the same activity in dififerent species. The term "paralog" refers 
to genes or proteins which are homologs via gene duplication, e.g., duplicated variants of 
a gene within a genome. See also, Fritch, WM (1970) Syst Zool 19:99-1 13, 

"Cells," "host ceUs" or "recombinant host cells" are terms used interchangeably 
herein. It is understood that such terms refer not only to the particular subject cell but to 
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the progeny or potential progeny of such a cell. Because certain modifications may occur 
m succeedmg generations due to either mutation or environmental influences such 
progeny may not. in feet, be identical to the parent cell, but are stiU included within the 
scope of the term as used herein. 

5 A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 

encodmg one of the subject HDx polypeptides with a second amino acid sequence 
defimng a domain (e.g. polypeptide portion) foreign to and not substantially homologous 
with any domain of one of the HDx proteins. A chimeric protein may present a foreign 
domain which is found (albeit in a different protein) in an organism which also ^presses 

10 the first protein, or it may be an "interspecies", "intergenic", etc. fiision of protein 
structures expressed by different kinds of organisms. In general, a fiision protein can be 
represented by the general formula X-HDx-Y, wherein HDx represents a portion of the 
protem which is derived fi-om one of the HDx proteins, and X and Y are, independently, 
absent or represem amino acid sequences which are not related to one of the HDc 

15 sequences in an organism. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
DNA or RNA, refers to molecules separated fi-om other DNAs. or RNAs. respectively, 
that are present in the natural source of the macromolecule. For example, an isolated 
nucleic acid encoding one of the subject polypeptides preferably includes no more 
than 10 kilobases (kb) of nucleic acid sequence which naturally immediately flanks the 
HDx gene in genomic DNA, more preferably no more than 5kb of such naturally 
occurring flanking sequences, and most preferably less than l.Skb of such naturally 
occumng flanking sequence. The term isolated as used herein also refers to a nucleic 
acid or peptide that is substantially free of cellular material, viral material, or culture 
medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is 
meant to include nucleic acid fragments which are not naturally occurring as fragments 
and would not be found in the natural state. 

As described below, one aspect of the invention pertains to isolated nucleic acids 
compnsmg nucleotide sequences encoding ^Z>r polypeptides, and/or equivalents of such 
micleic acids. The term nucleic acid as used herein is intended to include fragments as 
equivalents. The tenn equivalent is understood to include nucleotide sequences encoding 
fimctionally equivalent HDx polypeptides or functionally equivalent peptides having an 
acnvity of an HDx protein such as described herein. Equivalem nucleotide sequences wUl 
include sequences that differ by one or more nucleotide substitutions, additions or 
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deletions. such as allelic variants; and will, therefore, include sequences that differ from 
the nucleotide sequence of the HDx cDNA sequences shown in any of SEQ ID Nos:l-4 
due to the degeneracy of the genetic code. Equivalents will also include nucleotide 
sequences that hybridize under stringent conditions (i.e., equivalent to about 20-27'»C 
5 below the melting temperature (Tq^ of the DNA duplex formed in about IM salt) to the 
nucleotide sequences represented in one or more of SEQ ID Nos:l-4 In one 
embodiment, equivalents will further include nucleic acid sequences derived from and 
evolutionarily related to, a nucleotide sequences shown in any of SEQ ID Nos:l-4. 

Moreover, it will be generally appreciated that, under certain circumstances, it 
10 may be advantageous to provide homologs of one of the subject HDx polypeptides which 
function in a limited capacity as one of either an HDx agonist (mimetic) or an HDx 
antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological efiTects can be elicited 
by treatment with a homolog of limited fimction, and with fewer side effects relative to 
15 treatment with agonists or antagonists which are directed to all of the biological activities 
of naturally occurring forms of HDx proteins. 

Homologs of each of the subject i/Dx proteins can be generated by mutagenesis, 
such as by discrete point mutation(s), or by truncation. For instance, mutation can give 
rise to homologs which retain substantially the same, or merely a subset, of the biological 
20 activity of the HDx polypeptide from which it was derived. Alternatively, antagonistic 
forms of the protein can be generated which are able to inhibit the function of the 
naturally occuning form of the protein, such as by competitively binding to HDx 
substrate or /«>x associated protein, as for example competing with wild-type HDx in the 
bindmg of RbAp48 or a histone. In addition, agonistic forms of the protein may be 
25 generated which are constitutively active, or have an altered K^at or for deacetylation 
reactions. Thus, the HDx protein and homologs thereof provided by the subject 
invention may be either positive or negative regulators of transcription and/or repUcation. 

In general, polypeptides referred to herein as having an activity of an HDx protein 
(e.g.. are "bioactive") are defined as polypeptides which include an amino acid sequence 
30 coiT^ponding (e.g.. identical or homologous) to all or a portion of the amino acid 
sequences of an HDx proteins shown in any one or more of SEQ ID Nos:5-8 and which 
mimic or antagonize all or a portion of the biological^iochemical activities of a naturally 
occuiTing HDx protein. Examples of such biological activity include the ability to 
modulate proUferation of cells. For example, inhibiting histone deacetylation causes cells 
3 5 to arrest in Gl and G2 phases of the cell cycle. The biochemical activity associated with 
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HDx proteins of the present invention can also characterized in terms of binding to and 
(optionally) catalyzing the deacetylation of an acetylated histone. Another biochemical 
property of certain of the subject HDx proteins involves binding to other ceUular 
pi'oteins, such as RbAp48 or Sin3A. 

Other biological activities of the subject HDx proteins are described herein or will 
be reasonably apparent to those skiUed in the art. According to the present invention, a 
polypeptide has biological activity if it is a specific agonist or antagonist of a naturally- 
occurring form of an HE>x protein. 

Preferred nucleic acids encode an HDx polypeptide comprising an amino acid 
sequence at least 80% homologous, more preferably at least 85% homologous and most 
preferably at least 88% homologous with an amino acid sequence of a human HDx, e.g., 
such as selected from the group consisting of SEQ ID Nos: 5-8. Nucleic acids which 
encode polypeptides at least about 90%, more preferably at least about 95%, and most 
preferably at least about 98-99% homology with an amino acid sequence represented in 
one of SEQ ID Nos:5-8 are of course also within the scope of the invention, as are 
nucleic acids identical in sequence with any of the enumerated HDx sequences of the 
sequence Usting. In one embodiment, the nucleic acid is a cDNA encoding a polypeptide 
having at least one activity of the subject HDx polypeptide. 

In certain preferred embodiments, the invention features a purified or recombinant 
HDx polypeptide having peptide chain with a molecular weight in the range of 40kd to 
60kd, even more preferably in the range of 45-50 kd or 53-58kd. It will be understood 
that certain post-translational modifications, e.g., phosphorylation and the Uke, can 
increase the apparent molecular weight of the HDx protein relative to the unmodified 
polypeptide chain, and cleavage of certain sequences, such as pro-sequences, can likewise 
25 decrease the apparent molecular weight. 

In other preferred embodiments, the nucleic acid encodes an HDx polypeptide 
which includes both the v and x motifs, and preferably possess a histone deacetylase 
activity. For example, preferred HDx proteins are represented by the general formula A- 
(v motif)-B-(x motif)-C, wherein the v motif is an amino acid sequence represented in 
30 SEQ ID No. 12, more preferably SEQ ID No. 13, the x motif is an amino acid sequence 
represented in SEQ ID No. 14, more preferably SEQ ID No. 15, and A, B and C 
represent amino acid sequences which are correspond to JTOa: or HDsr-related proteins. 

Still other preferred nucleic acids of the present invention encode an HDx 
polypeptide which includes a polypeptide sequence corresponding to all or a portion of 
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amino acid residues of any one of SEQ ID Nos: 5-8, e.g., at least 5, 10, 25, 50 or 100 
amino acid residues of that region. 

Another aspect of the invention provides a nucleic acid which hybridizes under 
high or low stringency conditions to the nucleic acid represented by SEQ ID No: 1. 
5 Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45**C, followed by a wash of 2.0 x SSC at 
50°C, are knovm to those skilled in the art or can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt 
concentration in the wash step can be selected from a low stringency of about 2.0 x SSC 
10 at SO^'C to a high stringency of about 0.2 x SSC at SO'^C. In addition, the temperature in 
the wash step can be increased from low stringency conditions at room temperature, 
about 22°C, to high stringency conditions at about 65**C. 

Nucleic acids, having a sequence that dififers from the nucleotide sequences 
shown in one of SEQ ID Nos: 1-4 due to degeneracy in the genetic code are also within 

15 the scope of the invention. Such nucleic acids encode functionally equivalent peptides 
(i.e., a peptide having a biological activity of an //Dx polypeptide) but differ in sequence 
from the sequence shown in the sequence listing due to degeneracy in the genetic code. 
For example, a number of amino acids are designated by more than one triplet. Codons 
that specify the same amino acid, or synonyms (for example, CAU and CAC each encode 

20 histidine) may result in "silent" mutations which do not affect the amino acid sequence of 
an HDx polypeptide. However, it is expected that DNA sequence polymorphisms that do 
lead to changes in the amino acid sequences of the subject HDx polypeptides will exist 
among, for example, humans. One skilled in the art will ^predate that these variations 
in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids 

25 encoding polypeptides having an activity of an HDx polypeptide may exist among 
individuals of a given species due to natural allelic variation. 

As used herein, an HDx gene fragment refers to a nucleic acid having fewer 
nucleotides than the nucleotide sequence encoding the entire mature form of an HDx 
protein yet which (preferably) encodes a polypeptide which retains some biological 
30 activity of the full length protein. Fragment sizes contemplated by the present invention 
include, for example, 5, 10, 25, 50, 75, 100, or 200 amino adds in length. 

As indicated by the examples set out below, HDx protein-encoding nucleic acids 
can be obtained from mRNA present in any of a number of eukaryotic cells. It should 
also be possible to obtain nucleic acids encoding HDx polypeptides of the present 



wo 97/35990 



PCT/US97/05275 



-27- 



mvenuon from genome DNA from b«h «)ul,s =u,d e^b^o. For example, a gene 
encod.„g an HO. protein can be cloned from either . cDNA or a genomic l,b4 in 
a^rdance wnl, protocol, described herein, as well as .hose generally Icnown ,o p«^„. 

5 ^r;; '°" /**'^™~^"«'"''^^p--""''-''«^=dbyisou.irg.o^ 

^fe n H " ' * " embryonic cells. 

Double stranded cDNAs can then be prq,ared from the toul mRNA. ^ Jbseouently 
™ .nto a suitable pla^d or bacteriophage vector using any one of a nuIeTtf 

estabhshed polymerase chain reaction techniques in accordance with A. nucleoid! 

re^ltlZon^oC^K" " "^"^ = ""^-^ — 

■an.-. '0 "fU^ isolated nucleic acid in 

anttsens. therapy. As used herein. -anUsense" therapy refer, to administnttion or /„ 

rZ ""isonucleoUde probes or .heir detivatives which specifically hybridize 

(e g. binds) under cellular conditions, wiU, the ceUular mRNA a«l/or genomic DNA 
encoding one or more of the subject HD. proteins so as to inhibit expression of that 
prote,n, e g by inhibiting transcription and/or translation. Tlie binding may be by 
20 ^r"°T. '"t"" """Pl-^-^i^. or, for example, in U,e case of binding to DnI 
20 duplexes^^ ^o.^ specific interactions in the major groove of the double helix Jn 
W ant,s««e» therapy refers to ti,e range of techniques generally employed in the 
^I'u^es. °" oBsonucleotide 

25 ^.'"''T' "^''■''^'^ °"'>='<='iv««l, for example as 

25 ^ express-on pUsmid which, when transcribed in tite cell, produces RNA wW<^ " 

protem. Alternatively, .he anfsense construct is an oligonucleotide probe which is 

«3o„rvhTd"^ ""f- ^ '"'"^"^ i^'""- o 

30 , "^"^ -""-^"^ of a» JID. gene 

to endogenous releases, e.g. exonudeases and/or endonucleases. and are ti«refo,e 
2'-" Exemplary ™«.eic acid molecules for use as antisense oUgomrcleotir« H 
Phospho^udate, phosphothioate and methylphosphonate analogs of DNA (see^t u" 

35 - PcP^-e nucleic ac^ (^1 

Add-fonally, general approaches to constructing oligomers useful in antisense Thet^^y 
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have been reviewed, for example, by Van der Krol et al. (1988) Biotechniques 6:958- 
976; and Stein et al. (1988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the invention are usefiil in therapeutic, 
diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized 
5 in a manner appropriate for antisense therapy in general. For such therapy, the oligomers 
of the invention can be formulated for a variety of routes of administration, including 
systemic and topical or localized administration. Techniques and formulations generally 
may be found in Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, 
PA. For systemic administration, injection is preferred, including intramuscular, 
10 intravenous, intraperitoneal, and subcutaneous. For injection, the oligomers of the 
invention can be fonnulated in liquid solutions, preferably in physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the oligomers may be 
formulated in solid form and redissolved or suspended inunediately prior to use. 
Lyophilized forms are also included. 
15 Systemic adnunistration can also be by transmucosal or transdermal means, or the 

compounds can be administered oraUy. For transmucosal or transdermal administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, for example, for transmucosal 
administration bile salts and fusidic acid derivatives. In addition, detergents may be used 
20 to facilitate permeation. Transmucosal administration may be through nasal sprays or 
using suppositories. For oral administration, the oligomers are formulated into 
conventional oral administration forms such as capsules, tablets, and tonics. For topical 
administration, the oligomers of the invention are formulated into ointments, salves, gels, 
or creams as generally known in the art. 
25 In addition to use in therapy, the oUgomers of the invention may be used as 

diagnostic reagents to detect the presence or absence of the target DNA or KNA 
sequences to which they specifically bind. Such diagnostic tests are described in fiirther 
detail below. 

Likewise, the antisense constructs of the present invention, by antagonizing the 
30 normal biological activity of one of the HDx proteins, can be used in the manipulation of 
tissue, e.g. tissue differentiation or growth, both in vivo and ex vivo. 

Furthermore, the anti-sense techniques (e.g. micromjection of antisense 
molecules, or transfeaion with plasmids whose transcripts are anti-sense with regard to 
an HDx mRNA or gene sequence) can be used to investigate role of HDx in 
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30 



developmental events, as weU as the nonnal cellular function of HDx in adult tissue 
Such techniques can be utilized in cell culture, but can also be used in the creation of 
transgenic animals (described infra). 

This invention also provides expression vectors containing a nucleic acid 
encoding an HDx polypeptide, operably linked to at least one transcriptional regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence in a manner which allows expression of the nucleotide sequence 
Regulatory sequences are art-recognized and are selected to direct expression of the 
subject HDx proteins. Accordingly, the term transcriptional regulatory sequence includes 
promoters, enhancers and other expression control elements. Such regulatory sequences 
are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185 
Academic Press. San Diego. CA (1990). For instance, any of a wide variety of 
expression control sequences, sequences that control the expression of a DNA sequence 
when operatively linked to it, may be used in these vectors to express DNA sequences 
encodmg HDx polypeptides of this invention. Such useful expression control sequences 
include, for example, a viral I.TK, such as the LTR of the Moloney murine leukemia 
virus, the early and late promoters of SV40, adenovirus or cytomegalovirus immediate 
early promoter, the lac system, the trp system, the TAG or mc system, T7 promoter 
whose expression is directed by T7 RNA polymerase, the major operator and promoter 
regions of phage K the control regions for fd coat protein, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase. 
e.g.. Pho5, the promoters of the yeast a-mating factors, the polyhedron promoter of the 
baculovirus system and other sequences known to control the expression of genes of 
prokaryotic or eukaiyotic cells or their viruses, and various combinations thereof It 
should be understood that the design of the expression vector may depend on such 
factors as the choice of the host cell to be transformed and/or the type of protein desired 
to be expressed. Moreover, the vector's copy number, the abUity to control that copy 
number and the expression of any other proteins encoded by the veaor. such as antibiotic 
markers, should also be considered. In one embodiment, the expression vector includes a 
recombmant gene encoding a peptide having an agonistic activity of a subject HDx 
polypeptide, or alternatively, encoding a peptide which is an antagonistic form of the 
HDx protem, such as a catalytically-inactive deacetylase. Such expression vectors can be 
used to transfect cells and thereby produce polypeptides, including fusion proteins 
encoded by nucleic acids as described herein. 
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Moreover, the gene constructs of the present invention can also be used as a part 
of a gene therapy protocol to deliver nucleic acids, e.g., encoding either an agonistic or 
antagonistic form of one of the subject HDx proteins or an antisense molecule described 
above. Thus, another aspect of the invention features expression vectors for in vivo or in 
5 vitro transfection and expression of an HDx polypeptide or antisense molecule m 
particular cell types so as to reconstitute the function of, or alternatively, abrogate the 
function of //Z>x-induced transcription in a tissue in which the naturally-occurring form of 
the protein is misexpressed; or to deliver a form of the protein which alters differentiation 
of tissue, or which inhibits neoplastic transformation. 

10 Expression constructs of the subject HDx polypeptides, as well as antisense 

constructs, may be administered in any biologically effective carrier, e.g. any formulation 
or composition capable of effectively delivering the recombinemt gene to cells in vivo. 
Approaches include insertion of the subject gene in viral vectors including recombinant 
retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus- 1, or 

15 recombinant bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; 
plasmid DNA can be delivered with the help of, for example, cationic liposomes 
(lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin 
S, artificial viral envelopes or other such intracellular carriers, as well as direct injection 
of the gene construct or CaP04 precipitation carried out in vivo. It will be appreciated 

20 that because transduction of appropriate target cells represents the critical first step in 
gene therapy, choice of the particular gene delivery system will depend on such factors as 
the phenotype of the intended target and the route of administration, e.g. locally or 
systemically. Furthermore, it will be recognized that the particular gene construct 
provided for in vivo transduction of HDx expression are also useful for in vitro 

25 transduction of cells, such as for use in the ex vivo tissue culture systems described 
below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use 
of a viral vector containing nucleic acid, e.g. a cDNA encoding the particular HDx 
polypeptide desired. Infection of cells with a viral vector has the advantage that a large 
30 proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are 
expressed eflRciently in cells which have taken up viral vector nucleic acid. Retrovirus 
vectors, adenovirus vectors and adeno-associated virus vectors are exemplary 
recombinant gene delivery system for the transfer of exogenous genes in v/vo. 
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particularly into humans. These vectors provide eflScient delivery of genes into cells, and 
the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 

In addition to viral transfer methods, such as those illustrated above, non-viral 
methods can also be employed to cause expression of a subject HDx polypeptide in the 
5 tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms 
used by mammalian cells for the uptake and intracellular transport of macromolecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely on 
endocytic pathways for the uptake of the subject HDx polypeptide gene by the targeted 
cell. Exemplary gene delivery systems of this type include liposomal derived systems, 
10 poly-lysine conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic ^TOx gene can be 
introduced into a patient by any of a number of methods, each of which is familiar in the 
art. For instance, a pharmaceutical preparation of the gene deUveiy system can be 
introduced systemically, e.g. by intravenous injection, and specific transduction of the 
protein in the target cells occurs predominantly from specificity of transfection provided 
by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional 
regulatory sequences controlling expression of the receptor gene, or a combination 
thereof In other embodiments, initial delivery of the recombinant gene is more limited 
with introduction into the animal being quite localized. For example, the gene delivery 
vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by stereotactic 
injection (e.g. Chen et al. (1994) PNAS 91: 3054-3057). AN HDx gene, such as any one 
of the clones represented in the group consisting of SEQ ID NO: 1-4. can be delivered in 
a gene therapy construct by electroporation using techniques described, for example, by 
Dev et al. ((1994) Cancer Treat Rev 20:105-1 15). 
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The pharmaceutical preparation of the gene therapy construct can consist 
essentially of the gene delivery system in an acceptable dUuent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery system can be produced intact fi-om recombinant cells, e.g. 
retroviral vectors, tiie pharmaceutical preparation can comprise one or more cells which 
30 produce the gene delivery system. 

Another aspect of the present invention concerns recombinant forms of the HDx 
proteins. Recombinant polypeptides preferred by the present invention, in addition to 
native HDx proteins, are at least 80% homologous, more preferably at least 85% 
homologous and most preferably at least 88% homologous with an amino acid sequence 
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represented by any of SEQ ID Nos: 5-8. Polypeptides which possess an activity of an 
HDx protein (i.e. either agonistic or antagonistic), and which are at least 90%, more 
preferably at least 95%, and most preferably at least about 98-99% homologous with a 
sequence selected from the group consisting of SEQ ID Nos: 5-8 are also within the 
5 scope of the invention. In other preferred embodiments, the HDx polypeptide includes 
both the V and x motifs, and preferably possess a histone deacetylase activity. 

The term "recombinant HDx protein" refers to a polypeptide which is produced 
by recombinant DNA techniques, wherein generally, DNA encoding an HDx polypeptide 
is inserted into a suitable expression vector which is in turn used to transform a host cell 

10 to produce the heterologous protein. Moreover, the phrase "derived from", with respect 
to a recombinant HDx gene, is meant to include within the meaning of "recombinant 
protein" those proteins having an amino acid sequence of a native HDx protein, or an 
amino acid sequence similar thereto which is generated by mutations including 
substitutions and deletions (including tmncation) of a naturally occurring form of the 

15 protein. 

The present invention further pertains to recombinant forms of the subject HDx 
polypeptides which are encoded by genes derived from a mammal (e.g. a human), and 
which have amino acid sequences evolutionarily related to the HDx proteins represented 
in SEQ ID Nos: 5-8. Such recombinant HDx polypeptides preferably are capable of 

20 functioning in one of either role of an agonist or antagonist of at least one biological 
activity of a wild-type ("authentic") HDx protein of the appended sequence listmg. The 
term "evolutionarily related to", with respect to amino acid sequences of HDx proteins, 
refers to both polypeptides having amino acid sequences which have arisen naturally, and 
also to mutational variants of HDx polypeptides which are derived, for example, by 

25 combinatorial mutagenesis. 

The present invention also provides methods of producing the subject HDx 
polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured 
under appropriate conditions to allow expression of the peptide to occur. The cells may 

30 be harvested, lysed and the protein isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. The 
recombinant HDx polypeptide can be isolated from cell culture medium, host cells, or 
both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 

35 immunoafiinity purification with antibodies specific for such peptide. In a preferred 



PCt/US97/0S275 



-33- 

embodiment, the recombinant HDx polypeptide is a fusion protein containing a domain 
which facilitates its purification, such as GST fusion protein or poly(His) fusion protein. 

This invention also pertains to a host cell transfected to express recombinant 
forms of the subject HDx polypeptides. The host ceU may be any prokaryotic or 
5 eukaryotic cell. Thus, a nucleotide sequence derived from the cloning o{HDx proteins, 
encoding all or a selected portion of a fiiU-length protein, can be used to produce a 
recombinant form of an HDx polypeptide via microbial or eukaiyotic cellular processes. 
Ligating the polynucleotide sequence into a gene construct, such as an expression vector, 
and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or 
10 mammalian) or prokaryotic (bacterial ceUs), are standard procedures used in producing 
other weU-known proteins, e.g. MAP kinases, p53, WTl, PTP phosphatases. SRC, and 
the like. Similar procedures, or modifications thereof, can be employed to prepare 
recombinant HDx polypeptides by microbial means or tissue-culture technology in accord 
with the subject invention. 

1 5 The recombinant HDx genes can be produced by ligating nucleic acid encoding an 

HDx protein, or a portion thereof, into a vector suitable for expression in either 
prokaryotic cells, eukaryotic cells, or both. Expression vectors for production of 
recombinant forms of the subject HDx polypeptides include plasmids and other vectors. 
For instance, suitable vectors for the expression of an HDx polypeptide include plasmids 

20 of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived 
plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in 
prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. 
For instance, YEP24, Y1P5, YEP51. YEP52, pYES2, and YKP17 are cloning and 
expression vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, 
for example. Broach et al. (1983) in Experimental Manipulation of Gene Expression, 
ed. M. Inouye Academic Press, p. 83, incorporated by reference herein). These vectors 
can repUcate in E. coli due the presence of the pBR322 on, and in S. cerevisiae due to 
the replication determinant of the yeast 2 micron plasmid. In addition, drug resistance 
markers such as ampiciUin can be used. In an illustrative embodiment, an HDx 
polypeptide is produced recombinant^ utUizing an expression vector generated by sub- 
cloning the coding sequence of one of the HDx genes represented in SEQ ID Nos:l-4. 

The preferred mammalian expression vectors contain both prokaryotic sequences, 
to facilitate the propagation of the vector in bacteria, and one or more eukaryotic 
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transcription units that are expressed in eukaryotic cells. The pcDNAI/amp, 
pcDNAl/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, 
pSVT7, pko-neo and pHyg derived vectors are examples of mammalian expression 
vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified 
5 with sequences fi-om bacterial plasmids, such as pBR322, to facilitate replication and 
drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively, 
derivatives of viruses such as the bovine papillomavirus (BPV-1), or Epstein-Barr virus 
(pHEBo, pREP-derived and p205) can be used for transient expression of proteins in 
eukaryotic cells. The various methods employed in the preparation of the plasmids and 
10 transformation of host organisms are well known in the art. For other suitable expression 
systems for both prokaryotic and eukaryotic cells, as well as general recombinant 
procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 and 17. 

In some instances, it may be desirable to express the recombinant HDx 
1 5 polypeptide by the use of a baculovirus expression system. Examples of such baculovinis 
expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and 
pYL941), pAcUW-derived vectors (such as pAcUWl), and pBlueBac-derived vectors 
(such as the S-gal containing pBlueBac III). 

When it is desirable to express only a portion of an HDx protein, such as a form 
20 lacking a portion of the N-terminus, i.e. a truncation mutant which lacks the signal 
peptide, it may be necessary to add a start codon (ATG) to the oUgonucleotide Augment 
containing the desired sequence to be expressed. It is well known in the art that a 
methionine at the N-terminal position can be enzymatically cleaved by the use of the 
enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben- 
25 Bassat et al. (1987) LBacteriol. 169:751-757) and Salmonella typhimurium and its in 
vitro activity has been demonstrated on recombinant proteins (Miller et al. (1987) PNAS 
84:2718-1722). Therefore, removal of an N-termihal methionine, if desired, can be 
achieved either in vivo by expressing JfDx-derived polypeptides in a host which produces 
MAP (e.g., E. coli or CM89 or S. cerevi^ae), or in vitro by use of purified IVlAP (e.g., 
30 procedure of Miller et al., supra). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a 
part of a fusion gene including a nucleotide sequence encoding a diflFerent polypeptide. 
This type of expression system can be useful under conditions where it is desirable to 
produce an immunogenic fragment of an HDx protein. For example, the VP6 capsid 
35 protein of rotavims can be used as an immunologic carrier protein for portions of the 
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HDx polypeptide, either in the monomeric form or in the form of a viral particle The 
nucleic acd sequences coiresponding to the portion of a subject HDx protein to which 
anubod.es are to be raised can be incorporated into a fusion gene construct which 
mcludes codmg sequences for a late vaccinia vims structural protein to produce a set of 
recombmant viruses expressing fusion proteins comprising HDx epitopes as part of the 
vinon. It has been demonstrated with the use of immunogenic fiision proteins utilizing 
the Hepatitis B surface antigen fusion proteins that recombinant Hepatitis B virions can 
be utilized in this role as well. Similarly, chimeric constnicts coding for fission proteins 
containing a portion of an HDx protein and the poliovirus capsid protein can be created 
to enhance immunogenicity of the set of polypeptide antigens (see, for example EP 
Pubhcation No: 0259149; and Evans et al. (1989) Nature 339:385; Huang et al. (1988) 
J. Virol. 62:3855; and Schlienger et al. (1992) J. Virol. 66:2). 

The Multiple Antigen Peptide system for pepude-based immunization can also be 
utihzed to generate an immunogen, wherem a desired portion of an HDx polypeptide is 
obtained directly from organo-chemical synthesis of the peptide onto an oligomeric 
branching lysine core (see. for example. Posnett et al. (1988) JBC 263:1719 and Nardelli 
et al. (1992) J. Immunol. 148:914). Antigenic detenninants of ^Z>. proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity. it is widely 
appreciated that fusion proteins can also facilitate the expression of proteins and 
accordingly, can be used in the expression of the HDx polypeptides of the present 
myenuon. For example. ^Z>x polypeptides can be generated as glutathione-S-transferase 
(GST-fiision) proteins. Such GST-fusion proteins can enable easy purification of the 
HDx polypeptide, as for example by the use of glutathione-derivatized matrices (see for 
ex^ple. Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley 
& Sons, 1991)). 

In another embodiment, a fiision gene coding for a purification leader sequence 
such as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired 
portion of the recombinant protein, can allow purification of the expressed fiision protein 
by affimty chromatography using a Ni2^ metal resin. The purification leader sequence 
can then be subsequently removed by treatment with enterolcinase to provide the purified 
V^^zll91^ ^''''^ ^ 177; and Janknecht et ai. 
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Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fragments coding for different polypeptide 
sequences is performed in accordance with conventional techniques, employing blunt- 
ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
5 appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
the fusion gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs between two consecutive 
10 gene fragments which can subsequently be annealed to generate a chimeric gene sequence 
(see, for example. Current Protocols in Molecular Biology, eds. Ausubel et al. John 
Wiley & Sons: 1992). 

HDx polypeptides may also be chemically modified to create HDx derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
15 groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of HDx 
proteins can be prepared by linking the chemical moieties to functional groups on amino 
acid sidechains of the protein or at the N-terminus or at the C-terminus of the 
polypeptide. 

The present invention also makes available isolated HDx polypeptides which are 

20 isolated from, or otherwise substantially free of other cellular proteins, especially other 
signal transduction factors and/or transcription factors which may normally be associated 
with the HDx polypeptide. The term "substantially free of other cellular proteins" (also 
referred to herein as "contaminating proteins") or "substantially pure or purified 
preparations" are defined as encompassing preparations of HDx polypeptides having less 

25 than 20% (by dry weight) contaminating protein, and preferably having less than 5% 
contaminating protein. Functional forms of the subject polypeptides can be prepared, for 
the first time, as purified preparations by using a cloned gene as described herein. By 
"purified", it is meant, when referring to a peptide or DNA or RNA sequence, that the 
indicated molecule is present in the substantial absence of other biological 

30 macromolecules, such as other proteins. The term "purified" as used herein preferably 
means at least 80% by dry weight, more preferably in the range of 95-99% by weight, 
and most preferably at least 99.8% by weight, of biological macromolecules of the same 
type present (but water, buffers, and other small molecules, especially molecules having a 
molecular weight of less than 5000, can be present). The term "pure" as used herein 

35 preferably has the same numerical limits as "purified" immediately above. "Isolated" and 
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"purified" do not encompass either natural materials in their native state or natural 
materials that have been separated into components (e.g., in an acrylamide gel) but not 
obtained either as pure (e.g. lacking contaminating proteins, or chromatography reagents 
such as denaturing agents and polymers, e.g. acrylamide or agarose) substances or 
solutions. In preferred embodiments, purified HDx preparations will lack any 
contaminating proteins from the same animal from that HDx is normally produced, as can 
be accomplished by recombinant expression of. for example, a human HDx protein in a 
non-human cell. 

As described above for recombinant polypeptides, isolated HDx polypeptides can 
mclude all or a portion of an amino acid sequences corresponding to an HDx polypeptide 
represented in any one of SEQ ID Nos: 5-8 or homologous sequences thereto In 
preferred embodiments, the HDx polypeptide includes both the v and x motifs, and 
preferably possess a histone deacetylase activity. 

Isolated peplidyl portions of HDx proteins can be obtained by screening peptides 
recombinantly produced from the corresponding fragment of the nucleic acid encoding 
such peptides. In addition, fragments can be chemically synthesized using techniques 
known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. 
For example, smHDx polypeptide of the present invention may be arbitrarily divided into 
fragments of desired length with no overiap of the fragments, or preferably divided into 
20 overiapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments 
which can function as either agonists or antagonists of a wild-type (e.g., "authentic") 
HDx protein. 

The recombinant HDx polypeptides of the present invention also include 
homologs of the authentic HDx proteins, such as versions of those protein which are 
resistant to proteolytic cleavage, as for example, due to mutations which alter 
ubiquitination or other enzymatic targeting associated with the protein. 

Modification of the structure of the subject HDx polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g.. ex vivo shelf 
life and resistance to proteolytic degradation in vivo), or post-translational modifications 
(e.g.. to alter phosphorylation pattern of protein). Such modified peptides, when 
designed to retain at least one activity of the naturally-occurring form of the prolein, or 
to produce specific antagonists thereof; are considered fimctional equivalents of the HDx 
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polypeptides described in more detail herein. Such modified peptides can be produced, 
for instance, by amino acid substitution, ddetion, or addition. 

For example, it is reasonable to expect that an isolated replacement of a leucine 
with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
5 similar replacement of an amino acid with a structurally related amino acid (i.e. isosteric 
and/or isoelectric mutations) will not have a major eflFect on the biological activity of the 
resulting molecule. Conservative replacements are those that take place within a family of 
amino acids that are related in their side chains. Genetically encoded amino acids are can 
be divided into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, 

10 arginine, histidine; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparagine, 
glutamine, cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine 
are sometimes classified jointly as aromatic amino acids. In similar fashion, the amino 
acid repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, 

15 arginine histidine, (3) aliphatic = glycine, alanine, valine, leucine, isoleucine, serine, 
threonine, vnth serine and threonine optionally be grouped separately as aliphatic* 
hydroxyl; (4) aromatic == phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, 
glutamine; and (6) sulfiir -containing = cysteine and methionine, (see, for example. 
Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 1981). Whether a 

20 change in the amino acid sequence of a peptide results in a fiinctional HDx homolog (e.g. 
fimctional in the sense that the resulting polypeptide mimics or antagonizes the wild-type 
form) can be readily determined by assessing the abihty of the variant peptide to produce 
a response in cells in a fashion similar to the wild-type protein, or competitively inhibit 
such a response. Polypeptides in which more than one replacement has taken place can 

25 readily be tested in the same manner. 

This invention further contemplates a method for generating sets of combinatorial 
mutants of the subject HDx proteins as well as truncation mutants, and is especially 
usefijl for identifying potential variant sequences (e.g. homologs) that are functional in 
modulating histone deacetylation. The purpose of screening such combinatorial libraries 

30 is to generate, for example, novel HDx homologs which can act as either agonists or 
antagonist, or alternatively, possess novel activities all together. To illustrate, HDx 
homologs can be engineered by the present method to provide selective, constitutive 
activation of enzymatic activity. Thus, combinatorially-derived homologs can be 
generated to have an increased potency relative to a naturally occurring form of the 

35 protein. 
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Likewise, HDx homoiogs can be generated by the present combinatorial approach 
to selectively inhibit (antagonize) histone deacetylation. For instance, mutagenesis can 
provide HDx homoiogs which are able to bind other regulatory proteins or cytoskeletal 
elements (or DNA) yet prevent acetylation of histones, e.g. the homoiogs can be 
5 dominant negative mutants. In a preferred embodiment, a dominant negative mutant of 
an HDx protein is mutated at one or more residues of its catalytic site and/or specificity 
subsites. 

In one aspect of this method, the amino acid sequences for a population HDx 
homoiogs or other related proteins are aligned, preferably to promote the highest 

10 homology possible. Such a population of variants can include, for example, HDx 
homoiogs from one or more species. Amino acids which appear at each position of the 
aligned sequences are selected to create a degenerate set of combinatorial sequences. In 
a preferred embodiment, the variegated library of HDx variants is generated by 
combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene 

15 library. For instance, a mixture of synthetic oligonucleotides can be enzymatically ligated 
into gene sequences such that the degenerate set of potential HDx sequences are 
expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins 
(e.g. for phage display) containing the set of HDx sequences therein. 

As illustrated in Figure 5B, to analyze the sequences of a population of variants, 
20 the amino acid sequences of interest can be aligned relative, to sequence homology. The 
presence or absence of amino acids from an aligned sequence of a particular variant is 
relative to a chosen consensus length of a reference sequence, which can be real or 
artificial. For instance, Figure 5B includes the alignment of the v and x-motifs for several 
of the HDx gene products. Analysis of the alignment of these sequences from the HDx 
clones can give rise to the generation of a degenerate Ubrary of polypeptides comprising 
potential HDx sequences. In an exemplary embodiment, a library of variants based on 
the HDJ sequence, but degenerate across each of the v and x-motifs can be provided. 
On such Ubrary can be represented by the general formula A-(v motif)-B-(x motif)-C, 
wherein the v motif is an amino acid sequence represented in the general formula 

DIAX,hIWAGGUfflAKKX2EASGFCirVNDIVX3X4lLEIiKY^ 
7TD-RVMTVSF 

the X motif is an amino acid sequence represented in the general formula 
CVEXgVKX9FNXioPIXXiiLGGGGYTXi2WsrVARCVVTYET 
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A corresponds to Metl-Thrl29 of SEQ ID No. 5, B corresponds to Hisl99-Lys283 of 
SEQ ID No. 5. and C corresponds to Ala317.Ala482 of SEQ ID No. 5, wherein Xj 
represents He or Val; represents Phe or Ser; X3 represents Phe or Leu; X4 represents 
Gly or Ala; X5 represents Pro or Gin; Xg represents Gin or Glu; X7 represents Leu or 
Thr; Xg represents Val or Tyr; X9 represents Thr or Ser; Xjq represents Leu or lie; X^ 
represents Met or Val; and X12 represents lie or Val. To further expand the 
combinatorial set, other conservative mutations relative to those appearing in the human 
sequences can be provided. For example, in a more expansive Ubrary, X, represents Gly, 
Ala, Val, lie or Leu; X2 represents Phe, Tyr, Thr or Ser; X3 represents Phe, Tyr, Gly,' 
Ala, Val, He or Leu; X4 represents Gly, Ala, Val, lie or Leu; X5 represents Pro, Asn or 
Gin; Xfi represents Asn, Gin, Asp or Glu; X7 represents Gly, Ala, Val, He, Leu, Ser or 
Thr; Xg represents Gly, Ala, Val, He, Leu, Phe or Tyr; X9 represents Thr. Cys, or Ser; 
Xjo represents Gly, Ala, Val, He or Leu; X^ represents Met, Cys, Gly, Ala, Val, He, 
Leu, Ser or Thr; and Xjj represents Gly, AJa, Val. He or Leu. In still another Ubrary] 
15 each degenerate position can be any one of the naturally occurring amino acids. 
Likewise, the v and x-motifs can correspond to the degenerate sequences designated by 
SEQ ID Nos. 12 and 14, respectively. 

There are many ways by which such libraries of potential HDx homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 

20 degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the 
synthetic genes then ligated into an appropriate expression vector. The purpose of a 
degenerate set of genes is to provide, in one mixture, all of the sequences encoding the 
desired set of potential HDx sequences. The synthesis of degenerate oligonucleotides is 
well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et 

25 al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macro^nolecules, ed. AG 
Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 
53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 
11:477. Such techniques have been employed in the directed evoluUon of other proteins 
(see. for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 

30 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 
6378-6382; as weU as U.S. Patents Nos. 5.223.409, 5,198,346, and 5,096,815). 

Likewise, a library of coding sequence fragments can be provided for an HDx 
clone in order to generate a variegated population oiHDx fragments for screening and 
subsequent selection of bioactive fragments. A variety of techniques are known in the art 

35 for generating such Ubraries. including chemical synthesis. In one embodiment, a library 
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of coding sequence fragments can be generated by (i) treating a double stranded PCR 
fragment of an HDx coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule; (ii) denaturing the double stranded DNA; (iii) 
renaturing the DNA to form double stranded DNA which can include sense/antisense 
pairs from different nicked products; (iv) removing single stranded portions from 
reformed duplexes by treatment with SI nuclease; and (v) Ugating the resulting fragment 
library into an expression vector. By this exemplary method, an expression library can be 
derived which codes for N-terminal, C-terminal and internal fragments of various sizes. 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
Ubraries for gene products having a certain property. Such techniques will be generally 
adaptable for rapid screening of the gene libraries generated by the combinatorial 
mutagenesis HDx homologs. The most widely used techniques for screening large 
gene libraries typicaUy comprises cloning the gene library into replicable expression 
vectors, transforming appropriate cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates relatively easy isolation of the vector encoding the gene whose product 
was detected. 

In an exemplary embodiment, the Ubrary q^HDx variants is expressed as a fusion 
20 protein on the surface of a viral particle. For instance, in the fflamentous phage system, 
foreign peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to affinity 
matrices at very high concentrations, a large number of phage can be screened at one 
time. Second, since each infectious phage displays the combinatorial gene product on its 
25 surface, if a particular phage is recovered from an affinity matrix in low yield, the phage 
can be ampUfied by another round of infection. The group of almost identical E. coli 
filamentous phages Ml 3, fd., and fl are most often used in phage display libraries, as 
either of the phage gfll or gVIH coat proteins can be used to generate fiision proteins 
without disrupting the ultimate packaging of the viral particle (Ladner et al. PCT 
publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. 
(1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barijas et al. (1992) PNAS 89 4457- 
4461). 

For example, the recombinant phage antibody system ^AS, Pharmacia Catalog 
35 number 27-9400-01) can be easily modified for use in expressing and screening HDx 



30 



wo 97/35990 



PCT/US97/05275 



-42. 

combinatorial libraries by panning on glutathione immobilized histones/GST fiision 
proteins or RbAp48/GST fusion protein to enrich for HDx homoiogs which retain an 
ability to bind a substrate or regulatory protein. Each of these HDx homoiogs can 
subsequently be screened for further biological activities in order to differentiate agonists 
5 and antagonists. For example, histone-binding homoiogs isolated from the combinatorial 
library can be tested for their en2ymatic activity directly, or for their effect on cellular 
proliferation relative to the wild-type fomi of the protein. 

The invention also provides for reduction of the HDx or RbAp48 or histones 
proteins to generate mimetics, e.g. peptide or non-peptide agents, which are able to 

10 disrupt a biological activity of an HDx polypeptide of the present invention, e.g. as 
catalytic inhibitor or an inhibitor of protein-protein interactions. Thus, such mutagenic 
techniques as described above are also useful to map the determinants of the HDx 
proteins which participate in protein-protein or protein-DNA interactions involved in, for 
example, interaction of the subject HDx polypeptide with histones, RbAp48 or 

15 cytoskeletal elements. To illustrate, the critical residues of a subject HDx polypeptide 
which are involved in molecular recognition of histones can be determined and used to 
generate //Dr-derived peptidomimetics which competitively inhibit binding of the 
authentic HDx protein with that moiety. Likewise, residues of a histone or of RbAp48 
involved in binding to HDx proteins can be identified, and peptides or peptidomimetics 

20 based on such residues can also be used as competitive inhibitors of the interaction of an 
HDx protein with either of those proteins. By employing, for example, scanning 
mutagenesis to map the amino acid residues of a protein which is involved in binding 
other proteins, peptidomimetic compounds can be generated which mimic those residues 
which facilitate the interaction. Such mimetics may then be used to interfere with the 

25 normal flmction of an HDx protein. For instance, non-hydrolyzable peptide analogs of 
such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azepine (e.g., see Huffinan et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma 

30 lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988), keto*methylene pseudopepUdes 
(Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure 
and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical 
Co. Rockland, IL, 1985), p-tum dipeptide cores (Nagai et al. Tetrahedron Lett 

35 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and p-aminoalcohols 
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(Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Dann et al. (1986) 
Biochem Biophys Res Commun 1 34:7 1 ). 

Another aspect of the invention pertains to an antibody specifically reactive with 
an HDx protein. For example, by using immunogens derived fi-om an HDx protein, e.g. 
5 based on the cDNA sequences, anti-protein/anti-peptide antisera or monoclonal 
antibodies can be made by standard protocols (See, for example. Antibodies: A 
Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A 
mammal, such as a mouse, a hamster or rabbit can be immunized with an immunogenic 
form of the peptide (e.g., an HDx polypeptide or an antigenic fi^agment which is capable 
10 of eliciting an antibody response). Techniques for conferring unmunogenicity on a 
protein or peptide include conjugation to carriers or other techniques well known in the 
art. An immunogenic portion of an HDx protein can be administered in the presence of 
adjuvant. The progress of immunization can be monitored by detection of antibody titers 
in plasma or serum. Standard ELISA or other immunoassays can be used with the 
15 immunogen as antigen to assess the levels of antibodies. In a preferred embodiment, the 
subject antibodies are immunospecific for antigenic determinants of an HDx protein of a 
organism, such as a mammal, e.g. antigenic determinants of a protein represented by one 
of SEQ ID Nos: 5-8 or closely related homologs (e.g. at least 85% homologous, 
preferably at least 90% homologous, and more preferably at least 95% homologous). In 
20 yet a further preferred embodiment of the present invention, in order to provide, for 
example, antibodies which are immuno-selective for discrete /ZDr homologs, e.g. HDl, 
the mii-HDx pol>Tjeptide antibodies do not substantially cross react (i.e. does not react 
specifically) with a protein which is, for example, less than 85%, 90% or 95% 
homologous with the selected HDx. By "not substantially cross react", it is meant that 
the antibody has a binding affinity for a non-homologous protein which is at least one 
order of magnitude, more preferably at least 2 orders of magnitude, and even more 
preferably at least 3 orders of magnitude less than the binding affinity of the antibody for 
the intended target HDx. 

Following immunization of an animal with an antigenic preparation of an HDx 
30 polypeptide, znti-HDx antisera can be obtained and, if desired, polyclonal axAi-HDx 
antibodies isolated fi-om the serum. To produce monoclonal antibodies, antibody- 
producing cells (lymphocytes) can be harvested from an immunized animal and fiised by 
standard somatic cell fijsion procedures with immortalizing cells such as myeloma cells to 
yidd hybridoma cells. Such techniques are well known in the art, an include, for 
example, the hybridoma technique (originally developed by Kohler and NClstein, (1975) 
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Nature, 256: 495-497), the human B cell hybridoma technique (Kozbar et al., (1983) 
Immunology Today, 4: 72), and the EBV-hybridoma technique to produce human 
monoclonal antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened tmmunochemically for 
5 production of antibodies specifically reactive with an HDx polypeptide of the present 
invention and monoclonal antibodies isolated from a culture comprising such hybridoma 
cells. 

The term antibody as used herein is intended to include firagments thereof which 
are also specifically reactive with one of the subject HDx polypeptides. Antibodies can be 

10 fi-agmented using conventional techniques and the fi^gments screened for utility in the 
same manner as described above for whole antibodies. For example, F(ab)2 fi^agments 
can be generated by treating antibody with pepsin. The resulting F(ab)2 fi'agment can be 
treated to reduce disulfide bridges to produce Fab fi-agments. The antibody of the 
present invention is further intended to include bispecific and chimeric molecules having 

1 5 affinity for an HDx protein conferred by at least one CDR region of the antibody. 

Both monoclonal and polyclonal antibodies (Ab) directed against authentic HDx 
polypeptides, or HDx variants, and antibody fi-agments such as Fab, F(ab)2, Fv and scFv 
can be used to block the action of one or more HDx proteins and allow the study of the 
role of these proteins in, for example, differentiation of tissue. Experiments of this nature 
20 can aid in deciphering the role of HDx proteins that may be involved in control of 
proliferation versus differentiation, e.g., in patterning and tissue formation. 

Antibodies which specifically bind HDx epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of each of the subject HDx polypeptides. Antx-HDx antibodies can 

25 be used diagnostically in imitiuno-precipitation and immuno-blotting to detect and 
evaluate HDx protein levels in tissue as part of a clinical testing procedure. For instance, 
such measurements can be useful in predictive valuations of the onset or progression of 
proliferative or difFerentiative disorders. Likewise, the ability to monitor HDx protein 
levels in an individual can allow determination of the efficacy of a given treatment 

30 regimen for an individual afflicted with such a disorder. The level of HDx polypeptides 
may be measured fi-om cells in bodily fluid, such as in samples of cerebral spinal fluid or 
anmiotic fluid, or can be measured in tissue, such as produced by biopsy. Diagnostic 
assays using anti-HDx antibodies can include, for example, inmiunoassays designed to aid 
in early diagnosis of a disorder, particularly ones which are manifest at birth. Diagnostic 
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a«ays using anti-/,& polypeptide antibodies can also indude imn,u„oassays designed ,o 
aid in early diagnosis and phenotyping neoplastic or hypenJlastic disorder. 

Another application of anti-/«D^ antibodies of the present Invention is in the 

5 r^'w^t'T?""* '^""'^ -ch as X 

gtn, Xgtl8.23. XZAP, and X0RF8. Messenger Bbt^es of tbis type, having coding 

a^uences inserted in the correct reading W and orientation, can prod.^ «ision 

o^^aJl"."^"- "'"^ """^^ consist 

Of B-galactosidase antn,o acid sequences and whose carboxy tennini consist of a foreign 

P°^=PW= Antigenic epitopes ofan/^fl, protein e g. other onhologs Of a particu^ 
ft r - *c" be detected with 

»r«n"' "'""""'"^ filters Med from infected plates with 

anti-fflb, antibodies. Positive phage daected by this assay can U^n be isolated from the 
mfeaed plate. Thus, the presence of /ffix bomoiogs can be detected and cloned from 
other animals, as can alternate isoforms Cmcluding spBcing variants) from humans. 

Moreover, the nucleotide sequences determined from the cloning of HDx genes 
from orgamsms will &„hcr allow for Uie generation of probes and primers design«l for 
use m Identifying and^. doning HD. homologs in other cell types, e g. from other 
^es, as well as homologs from other organisms. For instance, the present 
mvention also provides a probe/primer comprising a substantially purified 
ohgonudeotide, whid, oligonudeotide comprises a region of nucleotide sequerL that 
hybridizes under stringent conditions to at least 10 consecutive nudeotides of sense or 
antws^uie sequence selected from the group consisting of SEQ JD Nos: or natiirally 
™g mutants tiiereof For instance, primers based on the nudeic add represented in 
SEQ ID Nos: 1-4 can be used in PCR reactions to done HDx homologs. Likewise 
^obes based on .he subject Hn. sequences can be used to detect transcripts or geZic 
the same or homologous proteins. In preferred embodimems, the 
^obe further comprises a labd group attached thereto and able to be deteaed. e g the 
label group is sdected from amongst radioisotopes, fluotescent compounds en^s 
and enzyme co-factors. ■'us, enzymes, 

or tissu!"^-?'^ can also be used as a pari of a diagnostic test kit for identifying ceUs 
OTte^ie which miseicpress an HD. protein, such as by measuring a level of W 

levels or detennmmg whether a genomic HDx gene has been mutated or deleted. 
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To illustrate, nucleotide probes can be generated from the subject HDx genes 
which facilitate histological screening of intact tissue and tissue samples for the presence 
(or absence) of //Z>x-encoding transcripts. Similar to the diagnostic uses of anti-iZZ>r 
antibodies, the use of probes directed to HDx messages, or to genomic HDx sequences, 
5 can be used for both predictive and therapeutic evaluation of allelic mutations which 
might be manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell 
growth) or abnormal dififerentiation of tissue. Used in conjunction with immunoassays as 
described above, the oligonucleotide probes can help facilitate the determination of the 
molecular basis for a developmental disorder which may involve some abnormality 
10 associated with expression (or lack thereof) of an HDx protein. For instance, variation in 
polypeptide synthesis can be differentiated from a mutation in a coding sequence. 

Accordingly, the present method provides a method for determining if a subject is 
at risk for a disorder characterized by aberrant cell proliferation and/or differentiation. In 
preferred embodiments, method can be generally characterized as comprising detecting, 

15 in a sample of cells from the subject, the presence or absence of a genetic lesion 
characterized by at least one of (i) an alteration affecting the integrity of a gene encoding 
an ^Z>x-protein, or (ii) the mis-expression of the HDx gene. To illustrate, such genetic 
lesions can be detected by ascertaining the existence of at least one of (i) a deletion of 
one or more nucleotides from an HDx gene, (ii) an addition of one or more nucleotides to 

20 an HDx gene, (iii) a substitution of one or more nucleotides of zn HDx gene, (iv) a gross 
chromosomal rearrangement of an HDx gene, (v) a gross alteration in the level of a 
messenger RNA transcript of an HDx gene, (vii) aberrant modification of an HDx gene, 
such as of the methylation pattern of the genomic DNA, (vii) the presence of a non-wild 
type splicing pattern of a messenger RNA transcript of an HDx gene, (viii) a non-wild 

25 type level of an //Dx-protein, and (ix) inappropriate post-translational modification of an 
JTEbc-protein. As set out below, the present invention provides a large number of assay 
techniques for detecting lesions in an HDx gene, and importantly, provides the ability to 
discern between different molecular causes underlying /ffibc-dependent aberrant cell 
growth, proliferation and/or differentiation. 

30 In an exemplary embodiment, there is provided a nucleic acid composition 

comprising a (purified) oligonucleotide probe including a region of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence of an HDx gene, such as 
represented by any of SEQ ID Nos: 1-4, or naturally occurring mutants thereof, or 5' or 
3' flanking sequences or intrpnic sequences naturally associated with the subject HDx 

35 genes or naturally occurring mutants thereof The nucleic acid of a cell is rendered 
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accessible for hybridization, the probe is exposed to nucleic add of the sample, and the 
hybridization of the probe to the sample nucleic acid is detected. Such techniques can be 
used to detect lesions at either the genomic or mRNA level, including deletions, 
substitutions, etc., as well as to determine mRNA transcript levels. 

In certain embodiments, detection of the lesion comprises utilizing the 
probe/primer in a polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos. 4,683, 195 
and 4,683,202), such as anchor PGR or RAGE PGR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1944) PNAS 91:360-364). the later of which can be particulariy useful 
for detecting poim mutations in the HDx gene. In a merely illustrative embodiment, the 
method includes the steps of (i) collecting a sample of ceUs from a patient, (ii) isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, (iii) contacting 
the nucleic acid sample with one or more primers which specifically hybridize to an HDx 
gene under conditions such that hybridization and ampUfication of the HDx gene (if 
present) occurs, and (iv) detecting the presence or absence of an amplification product, 
or detecting the size of the amplification product and comparing the length to a control 
sample. 

In still another embodiment, the level of an //Dr-protein can be detected by 
immunoassay. For instance, the cells of a biopsy sample can be lysed, and the level of an 
^Z>r-protein present in the cell can be quantitated by standard immunoassay techniques. 
In yet another exemplary embodiment, aberrant methylation patterns of an M)x gene can 
be detected by digesting genomic DNA from a patient sample with one or more 
restriction endonucleases that are sensitive to methylation and for which recognition sites 
exist in the HDx gene (including in the flanking and intronic sequences). See, for 
example, Buiting et al. (1994) Human Mol Genet 3:893-895. Digested DNA is sepa^ted 
by gel electrophoresis, and hybridized with probes derived from, for example, genomic or 
cDNA sequences. The methylation status of the HDx gene can be determined by 
comparison of the restriction pattern generated from the sample DNA with that for a 
standard of known methylation. 

In yet another aspect of the invention, the subject HDx polypeptides can be used 
to generate a "two hybrid" assay or an "interaction trap" assay (see, for example, U.S. 
Patent No. 5,283,317; Zervos et al. (1993) Gell 72:223-232; Madura et al. (1993) J Biol 
Chem 268:12046-12054; Baitd et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; and Brent WO94/10300), for isolating coding sequences 
for other ceUular proteins which bind HDxs ("//£)x-binding proteins" or "iXDx-bp"). 
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Such //Dx-binding proteins would likely be involved in the regulation of HDx, e.g., as 
regulatory subunits or transducers, or be substrates which are regulated by an HDx. 

Briefly, the interaction trap relies on reconstituting in vivo a functional 
transcriptional activator protein from two separate fusion proteins. In particular, the 
5 method makes use of chimeric genes which express hybrid proteins. To illustrate, a first 
hybrid gene comprises the coding sequence for a DNA-binding domain of a 
transcriptional activator fused in frame to the coding sequence for an HDx polypeptide. 
The second hybrid protein encodes a transcriptional activation domain fused in frame to a 
sample gene from a cDNA library. If the bait and sample hybrid proteins are able to 
10 interact, e.g., form an /fZ>r-dependent complex, they bring into close proximity the two 
domains of the transcriptional activator. This proximity is suflBdent to cause 
transcription of a reporter gene which is operably linked to a transcriptional regulatory 
site responsive to the transcriptional activator, and expression of the reporter gene can be 
detected and used to score for the interaction of the HDx and sample proteins. 

15 Furthermore, by making available purified and recombinant HDx polypeptides, 

the present invention facilitates the development of assays which can be used to screen 
for drugs, including HDx homologs, which are either agonists or antagonists of the 
normal cellular fimction of the subject HDx polypeptides, or of their role in the 
pathogenesis of cellular differentiation and/or proliferation and disorders related thereto. 

20 Moreover, because we have also identified ^Z>r-related proteins, such as the yeast RPD3 
proteins, as histone deacetylases, the present invention fiirther provides drug screening 
assays for detecting agents which modulate the bioactivity of /^Dx-related proteins. Such 
agents, when directed to, for example, fungal /^Cbr-related proteins, can be used in the 
treatment of various infections. In a general sense, the assay evaluates the ability of a 

25 compound to modulate binding between an HEhc polypeptide and a molecule, be it 
protein or DNA, that interacts with the HDx polypeptide. It will be apparent from the 
following description of exemplary assays that, in place of a human (or other mammalian) 
HDx protein, the assay can be derived with an //Z>x-related protein such as RPD3. 
Likewise, in place of human RbAp48 or Sin3A, other /ffibc-binding proteins can be used, 

30 e.g., other human proteins. Exemplary compounds which can be screened include 
peptides, nucleic acids, carbohydrates, small organic molecules, and natural product 
extract libraries, such as isolated from animals, plants, fungus and/or microbes. 

It is contemplated that any of the novel interactions described herein could be 
exploited in a drug screening assay. For example, in one embodiment, the interaction 
35 between an HDx protein and RbAp48 can be detected in the presence and the absence of 
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a test compound. In another embodiment, the abiUty of a compound to modulate the 
binding of an HDx protein, or /©x-related protein such as the yeast RPD3 with histones 
can be assessed. The identification of a test compound which influences, for example 
HDI catalyzed deacetylation of histones would be useful in the modulation of HDl 
actmty m mammalian cells, while the identification of a test compound which selectively 
inhibits the yeast RPD3 deacetylase activity would be useful as an antifiingal agent In 
other embodiments the effect of a test compound on the binding of an HDx protein to 
other molecules, such as cytoskeletal components, or other proteins identified by the 
^^-dependent ITS set out above, could be tested. A variety of assay formats will 
suffice and, in light of the present inventions, will be comprehended by a skilled artisan. 

In a preferred embodiment, assays which employ the subject mammalian HDx 
protems can be used to identify compounds that have therapeutic indexes more favorable 
than sodium butyrate, trapoxin, trichostatin or the like. For instance. trapoxin-Iike drugs 
can be identified by the present invention which have enhanced tissue-type or cell-type 
specificity relative to trapoxin. To illustrate, the subject assays be used to generate 
compounds which preferentially inhibit IL-2 mediated proUferation/activation of 
lymphocytes, or inhibit proliferation of certain tumor cells, without substantially 
mterfenng with other tissues, e.g. hepatocytes. Likewise, similar assays can be used to 
Identify drugs which inhibit proliferation of yeast cells or other lower eukaiyotes but 
which have a substantiaUy reduced effect on mammalian cells, thereby unproving 
therapeutic index of the dmg as an anti-mycotic agent. 

In one embodiment, the identification of such compounds is made possible by the 
use of differential screening assays which detect and compare doig-mediated inhibition of 
deacetylase activity between two or more different ^Z>x-Iike enzymes, or compare drug- 
mediated inhibition of formation of complexes involving two or more differem types of 
^Z>x-like proteins. To illustrate, the assay can be designed for side-by-side comparison 
Of the effect of a test compound on the deacetylase activity or protein interactions of 
tissue-type specific HDx proteins. Given the apparent diversity oi HDx proteins it is 
probable that different fiinctional HDx activities, or HDx complexes exist and. in certain 
instances, are localized to particular tissue or cell types. Thus, test compounds can be 
screened for agents able to inhibit the tissue-specific formation of only a subset of the 
possible repertoire of ^Z)x/reguIatory protein complexes, or which preferentially inhibit 
certain HDx enzymes. In an exemplary embodiment, an interaction trap assay can be 
denved using two or more different human /ZDx "bait" proteins, while the "fish-protein 
IS constant m each, e.g. a human RbAp48 constmct. Rumiing the interaction trap side- 
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by-side permits the detection of agents which have a greater eflfect (e.g. statistically 
significant) on the formation of one of the HDx/RbAp4S complexes than on the formation 
of the other HDx complexes. 

In similar fashion, differential screening assays can be used to exploit the 
5 difference in protein interactions and/or catalytic mechanism of mammalian HDx proteins 
and yeast RPD3 proteins in order to identify agents which display a statistically significant 
increase in specificity for inhibiting the yeast enzyme relative to the mammalian enzyme. 
Thus, lead compounds which act specifically on pathogens, such as fiingus involved in 
mycotic infections, can be developed. By way of illustration, the present assays can be 

10 used to screen for agents which may ultimately be useful for inhibiting at least one fiingus 
implicated in such mycosis as candidiasis, aspergillosis, mucormycosis, blastomycosis, 
geotrichosis, cryptococcosis, chromoblastomycosis, coccidioidomycosis, comdiosporosis, 
histoplasmosis, maduromycosis, rhinosporidosis, nocaidiosis, para-actinomycosis, 
penicilliosis, monoliasis, or sporotrichosis. For example, if the mycotic infection to which 

15 treatment is desired is candidiasis, the present assay can comprise comparing the relative 
eflfectiveness of a test compound on inhibiting the deacetylase activity of a mammalian 
HDx protein with its effectiveness towards inhibiting the deacetylase activity of an RPD3 
homolog cloned from yeast selected firom the group consisting of Candida albicans, 
Candida stellatoidea, Candida tropicalis, Candida parapsilosis, Candida krusei, 

20 Candida pseudotropicalis, Candida quillermondii, or Candida rugosa. Likewise, the 
present assay can be used to identify anti-fiingai agents which may have therapeutic value 
in the treatment of aspergillosis by selectively targeting RPD3 homologs cloned fi-om 
yeast such as Aspergillus fumigatus, Aspergillus flavus, Aspergillus niger, 
Aspergillus nidulans, or Aspergillus terreus. Where the mycotic infection is 

25 mucormycosis, the RPD3 deacetylase can be derived fi-om yeast such as Rhizopus 
arrhizus, Rhizopus oryzae, Absidia corymbifera, Absidia ramosa^ or Mucor pusillus. 
Sources of other i?f*Di activities for comparison with a mammalian /tt>r activity includes 
the pathogen Pneumocysiis carinii. 

In addition to such HDx therapeutic uses, anti-fiingal agents developed with such 
30 dififerentiai screening assays can be used, for example, as preservatives in foodstufif, feed 
supplement for promoting weight gain in livestock, or in disinfectant formulations for 
treatment of non-living matter, e.g., for decontaminating hospital equipment and rooms. 

In similar fashion, side by side comparison of inhibition of a mammalian HDx 
proteins and an insect /TOx-related proteins, will permit selection of HDx inhibitors which 
35 discriminate between the human/mammalian and insect enzymes. Accordingly, the 
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present invention expressly contemplates the use and fonnulations of the subject HDx 
therapeutics in insecticides, such as for use in management of insects like the fruit fly. 

In yet another embodiment, certain of the subject HDx inhibitors can be selected 
on the basis of inhibitory specificity for plant //Z>r-related activities relative to the 
mammalian enzyme. For example, a plant /Or-related protein can be disposed in a 
differential screen with one or more of the human enzymes to select those compounds of 
greatest selectivity for inhibiting the plant enzyme. Thus, the present invention 
specifically contemplates formulations of the subject HDx inhibitors for agricultural 
applications, such as in the form of a defoliant or the like. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 
preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Moreover, the effects of ceUular toxicity and/or bioavaUabiiity of the test 
compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with upstream or downstream elements. Accordingly, in 
an exemplary screening assay of the present invention, a reaction mixture is generated to 
include an HDx polypeptide, compound(s) of interest, and a "target polypeptide", e.g.. a 
protein, which interacts with the T/Dx polypeptide, whetiier as a substrate or by some 
other protein-protein interaction. Exemplary target polypeptides include histones, 
RbAp48 polypeptides. Sin3 polypeptides, and/or combinantions thereof or with other 
transciptional regulatory proteins (such as myc, max. etc. see Example 3)). Detection 
and quantification of complexes containing tiie HDx protein provide a means for 
determining a compound's efficacy at inhibiting (or potentiating) complex formation 
between the HDx and the target polypeptide. The efficacy of the compound can be 
assessed by generating dose response curves from data obtained using various 
concentrations of the test compound. Moreover, a control assay can also be performed 
to provide a baseline for comparison. In the control assay, isolated and purified HDx 
polypeptide is added to a composition containing the target polypeptide and the 
formation of a complex is quantitated in tiie absence of the test compound. 

Complex formation between the HDx polypeptide and the target polypeptide may 
be detected by a variety of techniques. Modulation of the formation of complexes can be 
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quantitated using, for example, detectably labeled proteins such as radiolabeled, 
fluorescently labeled, or enzymatically labeled HDx polypeptides, by immunoassay, by 
chromatographic detection, or by detecting the intrinsic activity of the acetylase. 

Typically, it will be desirable to immobilize either HDx or the target polypeptide 
5 to facilitate separation of complexes from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of HDx to the 
target polypeptide, in the presence and absence of a candidate agent, can be 
accomplished in any vessel suitable for containing the reactants. Examples include 
microtitre plates, test tubes, and micro-centrifiige tubes. In one embodiment, a fusion 

10 protein can be provided which adds a domain that allows the protein to be bound to a 
matrix. For example, glutathione-S-transferase/iZDx (GST/HDx) fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell lysates, 
e.g. an ^Ss-labeled, and the test compound, and the mixture incubated under conditions 

15 conducive to complex formation, e.g. at physiological conditions for salt and pH, though 
slightly more stringent conditions may be desired. Following incubation, the beads are 
washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly (e.g. beads placed in scintillant), or in the supernatant after the 
complexes are subsequently dissociated. Alternatively, the complexes can be dissociated 

20 from the matrix, separated by SDS-PAGE, and the level of JTOac-binding protein found in 
the bead fraction quantitated from the gel using standard electrophoretic techniques such 
as described in the appended examples. 

Other techniques for immobilizing proteins on matrices are also available for use 
in the subject assay. For instance, either HDx or target polypeptide can be immobilized 
25 utiliang conjugation of biotin and streptavidin. For instance, biotinylated HDx molecules 
can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known 
in the art (e.g., biotinylation kit. Pierce Chemicals, Rockford, BL), and immobilized in the 
wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies 
reactive with HDx, but which do not interfere with the interaction between the HDx and 
30 target polypeptide, can be derivatized to the wells of the plate, and HDx trapped in the 
wells by antibody conjugation. As above, preparations of an target polypeptide and a test 
compound are incubated in the //Dx-presenting wells of the plate, and the amount of 
complex trapped in the well can be quantitated. Exemplary methods for detecting such 
complexes, in addition to those described above for the GST-immobilized complexes, 
include immunodetection of complexes using antibodies reactive with the target 
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polypeptide, or which are reactive with HDx protein and compete with the target 
polypeptide; as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the target polypeptide, either intrinsic or extrinsic activity. In the 
instance of the latter, the enzyme can be chemically conjugated or provided as a fusion 
5 protein with the target polypeptide. To illustrate, the target polypeptide can be 
chemically cross-Iinked or genetically fused with horseradish peroxidase, and the amount 
of polypeptide trapped in the complex can be assessed with a chromogenic substrate of 
the enzyme, e.g. 3,3'-diamino-benzadine terahydrochloride or 4-chloro-l-napthol. 
Likewise, a fusion protein comprising the polypeptide and glutathione-S-transferase can 
10 be provided, and complex formation quantitated by detecting the GST activity using 1- 
chloro-2,4-dinitrobenzene (Habig et al (1974) J Biol Chem 249:7130). 

For processes which rely on immunodetection for quantitating one of the proteins 
trapped in the complex, antibodies against the protein, such as anti-jyz>r antibodies, can 
be used. Alternatively, the protein to be detected in the complex can be "epitope tagged" 
in the form of a fusion protein which includes, in addition to the HDx sequence, a second 
polypeptide for which antibodies are readily available (e.g. from commercial' sources). 
For instance, the GST fusion proteins described above can also be used for quantification 
of binding using antibodies against the GST moiety. Other useful epitope tags include 
myc-epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21150-21157) which 
includes a lO-residue sequence from c-myc. as weU as the pFLAG system (International 
Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 

In another embodiment of a drug screening, a two hybrid assay can be generated 
with an HDx and /«)x-binding protein. Drug dependent inhibition or potentiation of the 
interaction can be scored. 

Where the HDx proteins themselves, or in complexes with other proteins, are 
capable of binding DNA and modifying transcription of a gene, a transcriptional based 
assay using, for example, an transcriptional regulatory sequences responsive to HDx 
complexes operably linked to a detectable marker gene. For illustration, see Example 3. 

To test the eflFect of a histone deacetylase inhibitor on MadN35GALVP16 and 
Mad(Pro)N3SGALVPI6 mediated repression, we treated a duplicate set of transfections 
with 10 nM trapoxin for eight hours prior to harvest. In tiie representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 
fold while it had littie effect on the activity of Mad(Pro)N35GALVP16, suggesting that 
the histone deacetylation plays a direct role in mSin3A transcriptional repression (Figure 
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13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfected with the expression vector alone or in 
cells transfected with GALVP16 (data not shovm). Following trapoxin treatment, the 
repression observed for MadN35GALVP16 was still seven times greater than that of 
5 Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A-mediated repression; 
however, we can not rule out that mSin3A is capable of repression by mechanisms 
independent of histone deacetylation. 

10 Furthermore, each of the assay systems set out above can be generated in a 

"differential" format as set forth above. That is, the assay format can provide information 
regarding specificity as well as potency. For instance, side-by-side comparison of a test 
compound*s effect on different HDxs can provide information on selectivity, and permit 
the identification of compounds which selectively modulate the bioactivity of only a 

15 subset of the HDx family. 

Furthermore, inhibitors of the enzymatic activity of each of the subject HDx 
proteins can be identified using assays derived fi-om measuring the ability of an agent to 
inhibit catalytic conversion of a substrate by the subject proteins. For example, the ability 
of the subject HDx proteins to deacetylate a histone substrate, such as histone H4 (see 
20 examples), in the presence and absence of a candidate inhibitor, can be determined using 
standard enzymatic assays. 

A number of methods have been employed in the art for assaying lustone 
deacetylase activity, and can be incorporated in the drug screening assays of the present 
invention. In preferred embodiments, the assay will employ a labeled acetyl group linked 

25 to appropriate histone lysine residues as substrates. In other embodiments, a histone 
substrate peptide can be labeled with a group whose signal is dependent on the 
simultaneous presence or absence of an acetyl group, e.g., the label can be a fluorogenic 
group whose fluorescence is modulated (either quenched or potentiated) by the presence 
of the acetyl moiety. Using standard enzymatic analysis, the ability of a test agent to 

30 cause a statistically significant change in substrate conversion by a histone deacetylase 
can be measured, and as desirable, inhibition constants, e.g., Kj values, can be calculated. 
The histone substrate can be provided as a purified or semi-purified polypeptide or as 
part of a cell lysate. Likewise, the histone deacetylase can be provided to the reaction 
mixture as a purified or semi-purified polypeptide or as a cell lysate. Accordingly, the 
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reaction mixtures of the subject method can range from reconstituted protein mixtures 
denved with purified preparations of histones and deacetyiases, to mixtures of ceU 
lysates. e.g.. by admixing baculovirus lysates containing recombinant histones and 

deacetyiases. 

In an exemplaiy embodiment, the histone substrate for the subject assay is 
provided by isolation of radiolabeled histones from metabolically labeUed ceUs To 
Ulustrate. as described by Hay et al. (1983) J Biol Chem 258:3726-3734, HeLa cells can 
be labelled in culture by addition of [3H]acetate (New England Nuclear) to the culture 
media. The addition of butyrate, trapoxin or the like can be used to increase the 
abundance of acetylated histones in the cells. Radiolabelled histones can be isolated from 
the cells by extraction with H,S04 (Marushige et al. (1966) J Mol Biol 15 160-174) 
Briefly, cells are homogenized in buffer, centrifuged to isolate a nuclear peUet the 
subsequently homogenized nuclear pellet centrifuged through sucrose, and the resu'lting 
chromatm peUet extracted by addition of H,S04 to yield [3H]acetyl-iabeUed histones. In 
an alternate embodimem, nucleosome preparations containing [3H]acetyl-labelled 
histones can be isolated from the labelled cells. As described in the art, nucleosomes can 

be isolated from cell preparations by sucrose gradient centrifugation (Hay etal (1983)7 
Biol Chem 258:3726-3734; and Noll (1967) Naiure 215:360-363). and polynucleosomes 
can be prepared by NaCl precipitation from micrococcai nuclease digested cells (Hay et 
al.. supra). Sinular procedures for isolating labelled histones from other cells types 
mcludmg yeast, have been described. See. for example, Alonso et al. (1986) Biochem 

AopAv^^c/a866:161-169; andKreigeretal.(1974) y^/V,/C/rem249.332-334 In yet 
other embodiments, the histone is generated by recombinant gene expression, and 
mcludes an exogenous tag (e.g.. an HA epitope, a poIy(his) sequence or the like) which 
feciUtates m purification from cell extracts. In still oUier embodiments, whole nuclei can 
be isolated from metabolically labelled cells by micrococcai nuclease digestion (Hay et al 
supra) ^ J ; 

In stiU another embodiment, the deacetylase substrate can be provided as an 
acetylated peptide including a sequence corresponding to the sequence about the specific 
lysyl r^idues acetylated on histone, e.g., a peptidyl portions of the core histones H2A, 
H2B, H3 or H4. Such fragments can be produced by cleavage of acetylated histones 
denved from metabolically labelled cells, e.g., such as by treatment with proteolytic 
en^es or cyanogen bromide (Kreiger et al.. supra). In other embodiments, the 
acetylated peptide can be provided by standard solid phase synthesis usmg acetylated 
35 lysine residues (Kreiger et al , w/?ra). 
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Continuing with the iUustrative use of [3H]acetyl-labelled histones, the activity of 
a histone deacetylase in the subject assays is detected by measuring release of [3H]acetate 
by standard scintiUant techniques. In a merely illustrative example, a reaction mixture is 
provided which comprises a recombinant HDx protein suspended in buflfer, along with a 
sample of [3H]acetyl-labelled histones and (optionally) a test compound. The reaction 
mixture is maintained at a desired temperature and pH, such as 22°C at pH7.8, for 
several hours, and the reaction terminated by boiling or other form of denaturation 
Released [3H]acetate is extracted and counted. For example, the quenched reaction 
mixture can be acidified with concentrated HCl, and used to create a biphasic mixture 
with ethyl acetate. The resulting 2 phase system is thoroughly mixed, centrifuged. and 
the ethyl acetate phase collected and counted by standard scintillation methods. Other 
methods for detecting acetate release will be easily recognized by those skiUed in the art. 

In yet another embodiment, the drug screening assay is derived to include a whole 
cell recombinantiy expressing one or more of a target protein or HDx protein. The ability 
15 of a test agent to alter the activity of the HDx protein can be detected by analysis of the 
recombinant cell. For example, agonists and antagonists of the HDx biological activity 
can by detected by scoring for alterations in growth or diflFerentiation (phenotype) of the 
cell. General techniques for detecting each are well known, and v^ll vaiy with respect to 
the source of the particular reagent cell utilized in any given assay. 

For example, quantification of proliferation of cells in the presence and absence of 
a candidate agent can be measured wiUi a number of techniques well known in the art, 
including simple measurement of population growtii curves. For instance, where the 
assay involves proliferation in a liquid medium, turbidimetric techniques (i.e. absorbence/ 
transmittance of Ught of a given wavelength through the sample) can be utilized. For 
example, in the instance where the reagent cell is a yeast cell, measurement of absori>ence 
of light at a wavelength between 540 and 600nm can provide a convenienUy fast measure 
of cell growth. Likewise, ability to form colonies in solid medium (e.g. agar) can be used 
to readily score for proUferation. In other embodiments, an HDx substrate protein, such 
as a histone, can be provided as a fusion protein which permits the substrate to be 
30 isolated fi-om cell lysates and the degree of acetylation detected. Each of these 
techniques are suitable for high through-put analysis necessary for rapid screening of 
large numbers of candidate agents. 

In addition, where the ability of an agent to cause or reverse a transfonned 
phenotype, growth in solid media such as agar can further aid in establishing whether a 
35 mammalian cell is transformed. 
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Additionally, visual inspection of the morphology of the reagent cell can be used 
to determine whether the biological activity of the targeted HDx protein has been affected 
by the added agent. To Ulustrate, the ability of an agent to influence an apoptotic 
phenotype which is mediated in some way by a recombinant HDx protein can be assessed 
by visual microscopy. Likewise, the formation of certain cellular structures as part of 
differentiation, such as the formation of neuritic process, can be visualized under a light 
microscope. 

The nature of the effect of test agent on reagent cell can be assessed by measuring 
levels of expression of specific genes, e.g., by reverse transcription-PCR. Another 
method of scoring for effect on Hdx activity is by detecting cell-type specific marker 
expression through immunofluofescent staining. Many such markers are kiiovwi in the 
art, and antibodies are readily available. For example, the presence of chondroitin 
sulphate proteoglycans as well as type-U collagen are correlated with cartilage 
production in chondrocytes, and each can be detected by immunostaining. . Similarly, the 
human kidney differentiation antigen gpl60, human aminopeptidase A, is a marker of 
kidney induction, and the cytoskeletal protein troponin I is a marker of heart induction. 
In yet another embodiment, the alteration of expression of a reporter gene construct 
provided in the reagent cell provides a means of detecting the effect on HDx activity. For 
example, reporter gene constructs derived using the transcriptional regulatory sequences, 
e.g. the promoters, for developmentally regulated genes can be used to drive the 
expression of a detectable marker, such as a luciferase gene. In an Ulustrative 
embodiment, the construct is derived using the promoter sequence from a gene expressed 
in a particular differentiative phenotype. 

It is also deemed to be within the scope of this invention that the recombinant 
25 HDx cells of the present assay can be generated so as to comprise heterologous HDx 
proteins (i.e. cross-species expression). For example, HDx proteins from one species can 
be expressed in the cells of another under conditions wherein the heterologous protein is 
able to rescue loss-of-£unction mutations in the host cell. For example, the reagent cell 
can be a yeast ceU in which a human MDx protein (e.g. exogenously expressed) is the 
intended target for development of an anti-proliferative agent. To illustrate, the M778 
strain, MATa ura3-52 trplAl his3-200 leu2-l trklA rpd3A::HIS3, described by Vidal et 
al. (1991) Mol Cell Biol 6317-6327, which lacks a fimctionaJ endogenous RPD3 gene 
can be transfected with an expression plasmid including a mammalian HDx gene in order 
to complement the RPD3 loss-of-fiinction. For example, the coding sequence for HDl 
can be cloned into a pRS integrative plasmid containing a selectable marker (Sikorski et 
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al. (1989) Genetics 122:19-27), and resulting construct used to transform the M778 
strain. The resulting cells should produce a mammalian HDl protein which may be 
capable performing at least some of the functions of the yeast RPD3 protein. The HDx 
transformed yeast cells can be easier to manipulate than manunaiian cells, and can provide 
5 access to certain assay fomiats, such as turbidity detection methods, which may not be 
obtainable with mammalian cells. 

Moreover, the combination of the "mammalianized" strain with the strain M537 
{MATa ura3'52 trplAl his3'200 leul-l trklA, Vidal et al., supra) can provide an 
exquisitely sensitive ccU-based assay for detecting agent which specifically inhibit, for 
10 example, the yeast RPD3 deacetylase. 

In another aspect, the invention provides compounds useful for inhibition of 
HDxs. In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C, in which A is a specificity element for selective 
binding to an HDx, B is a linker element, and C is an electrophilic moiety capable of 
15 reacting with a nucleophilic moiety of an HDx, with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

In another aspect, the invention provides an aflSnity matrix for binding or 
purifying an HDx. In a preferred embodiment, the aflBnity matrix can be represented by 
the formula S-A-B-C, in which S is a solid or insoluble support, and A, B, and C are as 

20 described above. The solid or insoluble support S can be any of a variety of supports, 
many of which are known in the art, for synthesis of, or immobilization of, compounds, 
e.g., peptides, benzodiazepines, and the like. For a review of solid-supported synthesis, 
see, e.g., Hodge ei a/,. Polymer-supported Reactions in Organic Synthesis, John ^^ey 
& Sons, New York, 1980. The HDx inhibitor moiety A-B-C can be bonded directly to 

25 the support S, or can be bonded to the support S through a linking or spacing moiety, as 
is known in the art. 

In another aspect, the invention provides a method of inhibiting an HDx. The 
method comprises contacting the iZDr with a compound capable of inhibiting HDx 
activity, under conditions such that HDx activity is inhibited. In preferred embodiments, 
30 the compounds can be represented by the formula A-B-C, in which A, B, and C are as 
described above; with the proviso that the compound is not butyrate, trapoxin, or 
trichostatin. 

In another aspect, the invention provides a method of purifying an HDx. The 
method includes contacting a reaction mixture comprising an HDx with an affinity matrix 
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capable of selectively binding to an HDx, and separating at least one other component of 
the reaction mixture from the HDx. In a preferred embodiment, the affinity matrix can be 
represented by the formula S-A-B-C, in which S, A, B, and C are as described above. 

In general, the elements A, B, and C of the inhibitor compounds are selected to 
5 permit selective binding to, and inhibition of, at least one HDx. The elements A, B, and 
C can be selected to provide specificity for particular HDxs. For example, a series of 
candidate HDx inhibitor compounds can be synthesized, e.g., according to the 
combinatorial methods described infra, and the library of candidate compounds screened 
against one or more HDxs to determine the compound or compounds with optimal 
1 0 activity and specificity for a particular HDx. 

Thus, in preferred embodiments, the specificity element A is selected such that the 
HDx inhibitor compound binds selectively to an HDx. In general, the specificity element 
A wiU be selected according to factors such as the binding specificity of the HDx or HDxs 
to which the inhibitor compound should bind, ease of synthesis, stability in vivo or in 
15 vitro, and the like. In certain embodiments, the specificity element A is a 
cyclotetrapeptidyl moiety. In another embodiment, A is a substituted or unsubstituted 
axyl moiety. In yet another embodiment, A is a nonaromatic carbocycle. In still another 
embodiment, A is an amino acyl moiety (e.g., a natural or non-natural amino acyl 
moiety). In yet another embodiment, A is a heterocyclyl moiety. 

In preferred embodiments, B is selected fi-om the group consisting of substituted 
and unsubstituted C4-C8 alkyUdene, C4-C8 alkenylidene, C4-C8 alkynylidene, and D-E-F, 
in which D and F are independently absent or C2-C7 alkylidene, C2-C7 alkenylidene, or 
C2-C7 alkynyUdene, and E is O, S, or NR', in which R' is H, lower alkyl, lower alkenyl, 
lower alkynyl, aralkyl, aryl, or heterocyclyl. The element B should be selected to permit 
the specificity element A to interact with an HDx such that specific binding occurs, whUe 
poising the electrophilic moiety C for reaction with a nucleophilic moiety oiihcHDx. 

In a preferred embodiment, C is an electrophilic moiety that is approximately 
isosteric with an N-acetyl group (i.e., C has approximately the same steric bulk as an N- 
acetyl group)In preferred embodiments, the element C is capable of reacting, covalently 
or non-covalently, with a nucleophilic moiety of an HDx In certain preferred 
embodiments, the element C is capable of binding (e.g., by chelation) to a metal ion, e.g., 
a divalent metal ion, e.g., zinc or calcium. In preferred embodiments, C is selected from 
the group consisting of a.P-epoxyketones. a,p-epoxythioketones, a.P-epoxysuIfoxides, 
hydroxamic acids, a-haloketones, a-halothioketones, a-diazoketones. 
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diazothioketones. vinyl epoxides, trifluoromethylketone, trifluoromethylthioketone. 
enones (e.g., of ketones or thioketones), ynones (e.g., of ketones or thioketones), a,P- 
aaridinoketones, hydrazones, boronic acids, carboxylates. amides (e.g., -C(O)-amino). 
sulfones, aldehyde, alkyl halides, epoxides, and the like. 

In accordance with the foregoing, the moieties A, B, and C can illustratively be 
represented by the formulas depicted in Figure 6, in which represents one or more 
substituents selected from the group consisting of amino, halogen, alkyl, alkenyl, alkynyl 
aryl. aralkyl, heterocyclyl, azido, carboxyl, alkoxycarbonyl, hydroxyl, alkoxy cyano' 
tnfluoromethyl, and the like; R" is Cj-Cg alkyUdene, Cj-Cg alkenylidene, or C^-Cg 
alkynyhdene; R5 is hydrogen, alkyl, alkoxycarbonyl, aryloxycarbonyl, alkylsulfonyl 
aiylsulfonyl or aryl; is hydrogen, alkyl, aiyl, alkoxy, aryloxy, halogen, and the like R-^ 
IS hydrogen, alkyl, alkenyl, alkynyl, aryl, and the like; R7 is hydrogen, alkyl, aryl, alkoxy, 
aryloxy, amino, hydroxylamino, alkoxylamino, halogen, and the like; Rg is hydrogen, 
alkyl, halogen, and the like; R9 is hydrogen, alkyl, aryl, hydroxyl, alkoxy, aryloxy, amino, 
and the like; X is a good leaving group, e.g., diazo, halogen, a sulfate or sulfonate ester' 
e.g., a tosylate or mesylate, and the like; and Y is O or S. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consisting of cycloalkyls, 
unsubstituted and substituted aryls, heterocyclyls, amino acyls, and cyclotetrapeptides; B 

20 is selected from the group consisting of substituted and unsubstituted C4-C8 alkylidene, 
C4-C8 alkenylidene, C4-C8 alkynylidene, C4-C8 enyne, and D-E-F, in which D and F are 
independently absent or a C-C7 alkylidene. an C2-C7 alkenylidene, or an C2-C7 
alkynylidene, and E is O, S, or MR', in which R' represents H, a lower alkyl, a lower 
alkenyl, a lower alkynyl, an aralkyl, aryl, or a heterocyclyl; and C is selected from the . 

25 group consisting of 

O H 

o 00 o O' 

, and B(OH)2 (boronic 

acid); in which Z represents O. S, or NR5, and Y, R5, R'^, and R7 are as defined above. 
In preferred embodiments, R'^ is hydrogen. In certain preferred embodiments. B is not a 
C4-C8 alkylidene. In prefeired embodiments, if B is a C4-C8 alkylidene, C is not a 
boronic acid. In other preferred embodiments, the inhibitor compound is not trapoxin. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consistmg of cycloalkyls. 
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unsubstituted and substituted aryls. heterocyclyls. amino acyls. and cyclotetrapeptides; B 
IS selected from the group consisting of substituted and unsubstituted C4-C8 alkylidene 
C4-C8 alkenylidene. C4-C8 alkynylidene, C4-C8 enyne, and D-E-F, in which D and F are 
independently absent or C^C^ aJkyMene. C^-C^ alkenylidene, or C^-C^ alkynylidene 
and E is O, S. or NR', in which R' represents H. a lower alkyl, a lower alkenyl, a lower 
alkynyl. an aralkyl, an aiyl, or a heterocyclyl; and C is selected from the group consisting 

Y 



Y Y o 
.OH A ^NH2 



H H - 6 

' • ' «i which R9 IS as defined above. In preferred 

embodiments. B is not a C4-C8 alkylidene. In preferred embodiments, the inhibitor 
10 compound is not trichostatin. 

In still another preferred embodiment, an HDx inhibitor compound can be 
represented by the formula A-B-C, m which A is selected from the group consisting of 
cycloalkyls, unsubstituted and substituted aryls. heterocyclyls. amino acyls and 
cyclotetrapeptides; B is selected from the group consisting of substituted and 
35 unsubstituted C4-C8 alkylidene. C4-C8 alkenylidene. C4-C8 alkynyUdene. C4-C8 enyne 
and D-E-F. m which D and F are independently absent or a C1-C7 alkylidene. a C2.C7 
alkenyhdene. or a C^-C^ alkynylidene. and E is O. S. or NR'. in which R' is H, lower 




25 



alkyl, lower alkenyl. lower alkynyl, aralkyl. aryl. or heterocyclyl; and C is 
which Y is O or S, and R7 is as defined above. 

Certain HDx inhibitor compounds of the present invention may exist in particular 
geometnc or stereoisomeric forms. For example, amino acids can contain at least one 
chiral center. The present invention contemplates all such compounds, including cis- and 
trans-isomers, R- and S-enantiomers, diastereomers, the racemic mixtures thereof and 
other mixtures thereof, as falling within the scope of the invention. Additional asymmetric 
caiton atoms may be present m a substituent such as an alkyl group. All such isomers as 
well as mixtures thereof, are intended to be included in this invention. 

If. for instance, a particular enantiomer of a compound of the present invention is 
desired, it may be prepared by asymmetric synthesis, or by derivation with a chiral 
auxihary, where the resulting diastereomeric mixture is separated and the auxiliary group 
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cleaved to provide the pure desired enantiomer. Alternatively, where the molecule 
contains a basic functional group, such as amino, or an acidic functional group, such as 
carboxyl, diastereomeric salts can be formed with an appropriate optically-active acid or 
base, followed by resolution of the diastereomers thus formed by fractional crystallization 
5 or chromatographic means well knovm in the art, and subsequent recovery of the pure 
enantiomers. 

The term "alkyl" refers to the radical of saturated aliphatic groups, including 
straight-chain alkyl groups, branched-chain alkyl groups, cycloalkyl (alicyclic) groups, 
alkyl substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. In preferred 
10 embodiments, a straight chain or branched chain alkyl has 30 or fewer carbon atoms in its 
backbone (e.g., C1-C30 for straight chain, C3-C30 for branched chain), and more 
preferably 20 or fewer. Likewise, preferred cycloalkyls have from 4-10 carbon atoms in 
their ring structure, and more preferably have 5, 6 or 7 carbons in the ring structure. 

Unless the number of carbons is otherwise specified, "lower alkyl" as used herein 
15 means an alkyl group, as defined above, but having fi^om one to ten carbons, more 
preferably from one to six carbon atoms in its backbone structure. Likewise, "lower 
alkenyl" and "lower alkynyl" have similar chain lengths. Preferred alkyl groups are lower 
alkyls. In preferred embodiments, a substituent designated herein as alkyl is a lower 
alkyl. 

20 Moreover, the term "alkyl" (or "lower alkyl") as used throughout the specification 

and claims is intended to include both "unsubstituted alkyls" and "substituted alkyls", the 
latter of which refers to alkyl moieties having substituents replacing a hydrogen on one or 
more carbons of the hydrocarbon backbone. Such substituents can include, for example, 
halogen, hydroxyl, carbonyl (such as a carboxylate, alkoxycarbonyl, aryloxycarbonyl, 

25 alkylcarbonyl, arylcarbonyl, aldehyde, and the like), thiocarbonyl (such as a thioacid, 
alkoxycarbonyl, and the like), an alkoxyl, unsubstituted amino, mono- or disubstituted 
amino, amido, amidine, imine, nitro, azido, sulfhydryl, alkylthio, cyano, trifluoromethyl, 
sulfonato, sulfamoyl, sulfonamido, heterocyclyl, aralkyl, or an aromatic or heteroaromatic 
moiety. It will be understood by those skilled in the art that the moieties substituted on 

30 the hydrocarbon chain can themselves be substituted, as described above, if appropriate. 
Exemplary substituted alkyls are described below. Cycloalkyls can be further substituted 
with, e.g., alkyls, alkenyls, alkoxys, alkylthips, aminoalkyls, carbonyl-substituted alkyls, - 
CF3, -CN, and the like. 
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The terms "alkenyl" and "alkynyl" refer to unsaturated aliphatic groups analogous 
in length and possible substitution to the alkyls described above, but that contain at least 
one double or triple bond respectively. The term "enyne" refers to an unsaturated 
aliphatic moiety having at least one double bond and one triple bond. 

5 The terms "alkylidene," "alkenylidene," and "alkynylidene" are art-recognized and 

refer to moieties corresponding to alkyl, alkenyl. and alkynyl moieties as defined above, 
but having two valences available for bonding. 

The term "aryl" as used herein includes 5-, 6- and 7-membered single-ring 
aromatic groups that may include from zero to four heteroatoms. for example, phenyl, 

10 pyrrolyl, furanyl, thiophenyl, imidazolyl, oxazolyl, thiazolyl, triazolyl, pyrazolyl, pyridyl, 
pyrazinyl, pyridazinyl and pyrimidyl, and the like. Those aryl groups having heteroatoms 
in the ring structure may also be referred to as "aryl heterocycles" or "heteroaromatics".. 
The aromatic ring can be substituted at one or more ring positions with such substituents 
as described above, as for example, halogen, azido, alkyl, aralkyl, alkenyl, alkynyl, 

15 cycloalkyl, hydroxyl, amino, nitro, sulfhydiyl, imino, amido, carbonyl, carboxyl, silyl, 
ether, alkylthio, sulfonyl, sulfonamido, ketone, aldehyde, ester, a heterocyclyl, an 
aromatic or heteroaromatic moiety, -CF3, -CN, or the like. 

The term "aralkyl", as used herein, refers to an alkyl group substituted with an 
aryl group (e.g., an aromatic or heteroaromatic group). 

The terms "heterocyclyl" or "heterocyclic group" refer to non-aromatic 4- to 10- 
membered ring structures, more preferably 4- to 7-membered rings, which ring structures 
include one to four heteroatoms (e.g., O, N, S, P and the like). Heterocyclyl groups 
include, for example, pyrrolidine, oxolane, thiolane, imidazole, oxazole, piperidine, 
piperazine, morpholine, lactones, lactams such as azetidinones and pyrrolidinones, 
sultams, sultones, and the like. The heterocyclic ring can be substituted at one or more 
positions with such substituents "as described above, as for example, halogen, alkyl, 
aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, 
alkoxycarbonyl, aiyloxycarbonyl, carboxyl, silyl, ether, alkylthio, alkylsulfonyl, 
arylsulfonyl. ketone (e.g., -C(0)-alkyl or -C(O)-aryl), aldehyde, heterocyclyl, an aryl or 
30 heteroaryl moiety, -CFs. -CN, or the like. 

Compounds represented by the formula A-B-C, in which A, B, and C have the 
values described supra, can be synthesized by standard techniques of organic synthesis. 
For example, precursor synthons corresponding to each of the moieties A, B, and C, or 
subunits thereof, can be coupled in Unear or convergent syntheses to provide HDx 
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inhibitor compounds, or compounds readily converted thereto. Syntheses of the HDx 
mhibitor compound trichostatin, and related compounds, have been reported- see eg 
Massa. S. ei al. (1990) J. Med. Chem. 33:2845-49; Mori, K.. and Kosecki, K (1988) 
Tetrahedron 44:6013-20; Koseki. K. and Mori, K. European Patent Application EP 
5 331524 A2; Fleming. I. et al. (1983) Tetrahedron 39:841-46. Analogs of trapoxin have 
also been synthesized; see. e.g., Yoshida, H. and Sugita, K. {1992) Jpn. J. Cancer Res 
83:324-28. 

Thus, in an illustrative synthesis, a compound represented by the formula A-B-C 
in which A is an phenyl group, while B and C can have a variety of values, can be 
10 synthesized as shown below: 




3 

Scheme I 

According to the Scheme, a functionalized organometallic aiyl compound 
(MX=organometallic moiety; R is any substituent; X is a leaving group. e.g., halogen) 
(e.g.. organotin. boronate, aryUithium, cuprate, Grignard reagent, etc.) is alkylated or 
acylated to provide functionalized compounds (e.g.. the exemplary compounds 1. 2, or 1) 
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which can be further elaborated to provide compounds with a wide variety of substituents 
and carbon backbones. Other A moieties (e.g., specificity dements) can be obtained by 
use of appropriate synthons. e.g., by substituting vinylorganometallic compounds for the 
organometallic aryl compound of the Scheme (followed by further treatment, e.g., 
5 reduction, of the vinyl group, if desired, to yield an alkyl A moiety). By way of 
Illustration, as shown for compound 1, the carbonyl group can be used for elaboration, 
e.g., by reduction of the carbonyl group to an alcohol, conversion of the alcohol to a 
tosylate. and nucleophilic displacement of the tosylate by an acyl compound (e.g a 
ketone or ester) to provide a chain-lengthened product (Route A), which can be 
10 converted to a C(0)X functionality (e.g., by hydrolysis of an ester and conversion of the 
resultmg carboxylic acid to an acid chloride). Alternatively, the carbonyl group of I can 
be used for olefination (Route B), e.g., Homer-Emmons olefination. to provide an 
elaborated alkenyl compound. Also, the carbonyl group can be converted to an aJkynyl 
functionahty. e.g., via the Corey-Fuchs procedure, to provide an elaborated alkynyl 
15 compound. For purposes of clarity, only certain chain lengths and fonctional group 
pattenis are shown in the scheme; however, the skilled artisan will appreciate that many 
other compounds, with a variety of B moieties (i.e.. linking moieties), can be synthesized 
through analogous procedures. The C(0)X functionality (e.g., an acid chloride where X 
IS CI) can be converted to functional groups such as amide, hydrazido 
tnfluonnethylketone, enone. epoxide, aziridine. and the like, through methods 
conventional in the art. Thus, the synthetic pathways shown in the Scheme provide 
access to compounds having a variety of C moieties (e.g.. reactive moieties) suitable for 
substitution in the subject /^£)ar inhibitors. 

In vitro chemical synthesis provides a method for generating libraries of 
compounds that can be screened for ability to bind to or inhibit a target protein, e.g.. an 
HDx. Although in vitro methods have previously been used in the phannaceutical 
mdustry to identify potential drugs, recently developed methods have focused on rapidly 
and eflSciently generating and screening large numbers of compounds and are amenable to 
generaung HDx inhibitor compound libraries for use in the subject method. The various 
approaches to simultaneous preparation and analysis of large numbers of compounds 
(herem "combinatorial synthesis") each rely on the fundamental concept of synthesis on a 
solid support introduced for peptides by Menifield in 1963 (Memfield, R.B (1963)y^m 
Chem Soc 85:2149-2154). Many types of solid matrices have been successfully used in 
solid-phase synthesis, and can be selected according to the type of chemistry to be 
perfonned on the immobilized moieties, as is discussed in more detail below. 
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Several synthetic schemes have been suggested or employed for the combinatorial 
synthesis of organic compounds (see, e.g., E.M. Gordon et ai, J, Med. Chem. 37:1385- 
1401 (1994)). 

5 Multipin Synthesis 

One method for combinatorial synthesis of compounds is the multipin synthesis 
method. Briefly, Geysen and co-workers (Geysen et al. (1984) PNAS 81:3998-4002) 
introduced a method for generating compounds by a parallel synthesis on polyacrylic 
acid-grated polyethylene pins arrayed in the microtitre plate format. In the original 
10 experiments, about 50 nmol of a single compound was covalently linked to the spherical 
head of each pin, and interactions of each compound with a receptor or antibody could be 
determined in a direct binding assay. The Geysen technique can be used to synthesize 
and screen thousands of compounds per week using the multipin method, and the 
tethered compounds may be reused in many assays. In subsequent work, the level of 
1 5 compound loading on individual pins has been increased to as much as 2 ^imol/pin by 
grafting greater amounts of functionalized acrylate derivatives to detachable pin heads, 
and the size of the compound library has been increased (Valerio et al. (1993) Int J Pept 
Protein Res 42:1-9). Appropriate linker moieties have also been appended to the pins so 
that the compounds may be cleaved from the supports after synthesis for assessment of 
20 purity and evaluation in competition binding or fimctional bioassays (Bray et al. (1990) 
Tetrahedron Lett 31:5811-5814; Valerio et al. (\99\) Anal Biochent 197:168-177; Bray 
et al, (1991) Tetrahedron Lett 32:6163-6166), 

More recent applications of the multipin method have taken advantage of the 
cleavable linker strategy to prepare soluble compound libraries (Maeji et al. (1990) J 
25 Immunol Methods 134:23-33; Gammon et al. {199 \) J Exp Med 173 :609-617; Mutch et 
al, (1991) Pe/7//?c5 4:132-137). 

Divide^ouple-Recombine 

In another embodiment, a variegated library of HDx inhibitor compounds is 
30 provided on a set of beads utilizing the strategy of divide-couple-recombine (see, e.g., 
Houghten (1985) PNAS 82:5131-5135; and U.S. Patents 4,631,211; 5,440,016; 
5,480,971). Briefly, as the name implies, at each synthesis step where degeneracy (e.g., a 
plurality of different moieties) is introduced into the library, the beads are divided into as 
many separate groups to correspond to the number of different residues (e.g., fimctional 
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groups or other moieties) to be added at that position, the different residues coupled in 
separate reactions, and the beads recombined into one pool for the next step. 

In one embodiment, the divide-couple-recombine strategy can be carried out 
using the so-caUed "tea bag" method first developed by Houghten, where synthesis 
occurs on resin that is sealed inside porous polypropylene bags (Houghten et al. (1986) 
PNAS 82:5 131-5 135). Residues are coupled to the resins by placing the bags in solutions 
of the appropriate individual activated monomers, while all common steps such as resin 
washing and deprotection fif appropriate) are performed simultaneously in one reaction 
vessel. At the end of the synthesis, each bag contains a single compound, and the 
compounds may be liberated from the resins using a multiple cleavage apparatus 
(Houghten et al. (1986) Int J Pept Protein Res 27:673-678). This technique oflFers 
advantages of considerable synthetic flexibility and has been partially automated (Beck- 
Sickinger et al. (1991) Pept Res 4:88-94). Moreover, compounds can be produced in 
suflScient quantities (> 500 \xmo\) for purification and complete characterization if 
15 desired. 

Synthesis using the tea-bag approach is useful for the production of a library, 
albeit of limited size, as is illustrated by its use in a range of molecular recognition 
problems including antibody epitope analysis (Houghten et al. (1986) PNAS 82:5131- 
5135), peptide hormone structure-function studies (Beck-Sickinger et al. (1990) Int J 
20 Pept Protein Res 36:522-530; Beck-Sickinger et al. (1990) Eur J Biochem 194:449-456), 
and protein conformational mapping (Zimmerman et al. (1991) Eur J Biochem 200:519- 
528). 

25 Combinatorial Synthesis on Nontraditional Solid Supports 

The search for innovative methods of solid-phase synthesis has led to the 
investigation of alternative polymeric supports to the polystyrene-divinylbenzene matrix 
originally popularized by Merrifield. Cellulose, either in the form of paper disks 
(Blankemeyer-Menge et al. (1988) Tetrahedron Lett 29-5871-5874; Frank et al. (1988) 

30 Tetrahedron 44:603 1-6040; Eichler et al. (1989) Collect Czech Chem Commun 54:1746- 
1752; Frank, R. (1993) Bioorg Med Chem Lett 3:425-430) or cotton Augments (Eichler 
et al. (1991) Pept Res 4:296-307; Schmidt et al. (1993) Bioorg Med Chem Lett 3:441- 
446) has been successfully functionalized for peptide synthesis. Typical loadings attained 
with ceUulose paper range from 1 to 3 ^mol/cm^ and HPLC analysis of material cleaved 

35 from these supports indicates a reasonable quality for the synthesized peptides. 
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Alternatively, peptides may be synthesized on cellulose sheets via non-cleavable linkers 
and then used in ELISA-based binding studies (Frank, R. (1992) Tetrahedron 48:9217- 
9232). The porous, polar nature of this support may help suppress unwanted nonspecific 
protein binding efFeas. In one convenient configuration synthesis occurs in an 8 x 12 
5 microtiter plate format. Frank has used this technique to map the dominant epitopes of 
an antiserum raised against a human cytomegalovirus protein, following the overlapping 
peptide screening (Pepscan) strategy of Geysen (Frank, R. (1992) Tetrahedron 48:9217- 
9232). Other membrane-like supports that may be used for solid-phase synthesis include 
polystyrene-grafted polyethylene films (Berg et al, (1989) J Am Chem Sac 111:8024- 
10 8026). 

Comhinatorial Libraries by Light-Directed, Spatially Addressable Parallel Chemical 
Synthesis 

A scheme of combinatorial synthesis in which the identity of a compound is given 
15 by its locations on a synthesis substrate is termed a spatially-addressable synthesis. In one 
embodiment, the combinatorial process is carried out by controlling the addition of a 
chemical reagent to specific locations on a solid support (Dower et al, (1991) Armu Rep 
Med Chem 26:271-280; Fodor, S.P.A. (1991) Science 251:767; Pirrung et al. (1992) 
U.S. Patent No. 5,143,854; Jacobs et al. (1994) Trends Biotechnol 12:19-26). The 
20 technique combines two well-developed technologies: solid-phase synthesis chemistry 
and photolithography. The high coupling yields of solid-phase reactions allows efficient 
compound synthesis, and the spatial resolution of photolithography affords 
miniaturi2:ation. The merging of these two technologies is done through the use of 
photolabile protecting groups, e.g., amino protecting groups, in the synthetic procedure. 

25 The key points of this technology are illustrated in Gallop et al. (1994) J Med 

Chem 37:1233-1251. A synthesis substrate is prepared for compound synthesis through 
the covalent attachment of photolabile nitroveratryloxycarbonyl (NVOC) protected 
amino linkers. Light is used to selectively activate a specified region of the synthesis 
support for coupling. Removal of the photolabile protecting groups by lights 

30 (deprotection) results in activation of selected areas. After activation, the first of a set of 
residues, each bearing a photolabile protecting group, is exposed to the entire surface. 
Coupling only occurs in regions that were addressed by light in the preceding step. The 
reagent solution is removed, and the substrate is again illuminated through a second 
mask, activating a different region for reaction with a second protected building block. 

35 The pattern of masks and the sequence of reactants define the products and their 
locations. Since this process utilizes photolithography techniques, the number of 
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compounds that can be synthesized is limited only by the number of synthesis sites that 
can be addressed with appropriate resolution. The position of each compound is 
precisely known; hence, its interactions with other molecules can be directly assessed. 
The target can be labeled with a fluorescent reporter group to facilitate the identification 
of specific interactions with individual members of the matrix. 

In a light-directed chemical synthesis, the products depend on the pattern of 
illumination and on the order of addition of reactants. By varying the lithographic 
patterns, many different sets of test compounds can be synthesized in the same number of 
steps; this leads to the generation of many different masking strategies. 

Encoded Combinatorial Libraries - ^ 

In yet another embodiment, the subject method provides an HDx inhibitor 
compound library provided with an encoded tagging system. A recent improvement in 
the identification of active compounds fi-om combinatorial libraries employs chemical 
mdexing systems using tags that uniquely encode the reaction steps a given bead has 
undergone and, by inference, the structure it carries. Conceptually, this approach mimics 
phage display libraries, where activity derives from expressed peptides, but the structures 
of the active peptides are deduced fi*om the corresponding genomic DNA sequence. The 
first encoding of synthetic combinatorial libraries employed DNA as the code. Two forms 
of encoding have been reported: encoding with sequenceable bio-oiigomers (e.g., 
oligonucleotides and peptides), and binary encoding with non-sequenceable tags. 

Tagging with sequenceable bio-oligomers 

The principle of using oligonucleotides to encode combinatorial synthetic libraries 
25 was described in 1992 (Brenner et al. (1992) PNAS 89:5381-5383), and an example of 
such a library appeared the following year (Needles et al. (1993) PNAS 90: 10700-10704). 
A combinatorial library of nominally 7^ (= 823,543) peptides composed of all 
combinations of Arg, Gin, Phe, Lys, Val, D-Val and Thr (three-letter amino acid code), 
each of which was encoded by a specific dinucleotide (TA, TC, CT, AT, TT, CA and 
30 AC, respectively), was prepared by a series of alternating rounds of peptide and 
oligonucleotide synthesis on soUd support: In this work, the amine linking fianctionality 
on the bead was specifically differentiated toward peptide or oligonucleotide synthesis by 
simultaneously preincubating the beads with reagents that generate protected OH groups 
for oUgonucleotide synthesis and protected NH2 groups for peptide synthesis (here, in a 
35 ratio of 1:20). When complete, the tags each consisted of 69-mers, 14 units of which 
carried the code. The bead-bound library was incubated with a fluorescently labeled 
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antibody, and beads containing bound antibody that fluoresced strongly were harvested 
by fluorescence-activated cell sorting (FACS). The DNA tags were amplified by PGR and 
sequenced, and the predicted peptides were synthesized. Following such techniques 
HDx inhibitor compound libraries can be derived and screened using /ffirs of the subjeci 
5 invention. 

It is noted that an alternative approach useful for generating nucleotide-encoded 
synthetic peptide libraries employs a branched linker containing selectively protected OH 
and NH2 groups (Nielsen et al. (1993)7^^ Chem Soc 1 15:9812-9813; and Nielsen et al 
(1994) Methods Compart Methods Enzymol 6:361-371). This approach requires that 
equimolar quantities of test peptide and tag co-exist, though this may be a potential 
complication in assessing biological activity, especially with nucleic acid based targets. 

The use of oligonucleotide tags permits exquisitely sensitive tag analysis Even so 
the method requires careful choice of orthogonal sets of protecting groups required for 
alternating co-synthesis of the tag and the library member. Furthermore, the chemical 
lability of the tag. particularly the phosphate and sugar anomeric linkages may limit the 
choice of reagents and conditions that can be employed for the synthesis on non- 
ohgomenc libraries. In preferred embodiments, the Ubraries employ linkers permitting 
selective detachment of the test HDx inhibitor compound library member for bioassay. in 
part (as descnbed infra) because assays employing beads limit the choice of targets and 
in part because the tags are potentially susceptible to biodegradation. 

Peptides themselves have been employed as tagging molecules for combinatorial 
hbranes. Two exemplary approaches are described in the art. both of which employ 
branched linkers to solid phase upon which coding and ligand strands are alternately 
elaborated. In the first approach (Kerr JM et al. {1992) J An, Chem Soc 1 15:2529-2531) 
orthogonality in synthesis is achieved by employing acid-labile protection for the coding 
strand and base-labile protection for the ligand strand: 

In an alternative approach (Nikolaievetal. (1993) Pep/ i?« 6:161-170), branched 
Imkers are employed so that the coding unit and the test peptide are both attached to the 
same functional group on the resin. In one embodiment, a linker can be placed between 
the branch point and the bead so that cleavage releases a molecule containing both code 
and hgand (Ptek et al. (1991) Tetrahedron Lett 32:3891-3894). In another embodiment 
the linker can be placed so that the test peptide can be selectively separated from the 
bead, leaving the code behind. This last construct is particularly valuable because it 
permits screening of the test peptide without potential interference, or biodegradation, of 
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the coding groups. Examples in the art of independent cleavage and sequencing of 
peptide library members and their corresponding tags has confirmed that the tags can 
accurately predict the peptide structure. 

It is noted that peptide tags are more resistant to decomposition during ligand 
5 synthesis than are oligonucleotide tags, but they must be employed in molar ratios nearly 
equal to those of the ligand on typical 130 am beads in order to be successfully 
sequenced. As with oligonucleotide encoding, the use of peptides as tags requires 
complex protection/deprotection chemistries. 

1 0 Non-sequenceable tagging: binary encoding 

An alternative form of encoding the test peptide Ubraxy employs a set of non- 
sequenceable tagging molecules (e.g., molecules having electrophone moieties) that are 
used as a binary code (OWmeyer et al. (1993) PNAS 90:10922-10926). Exemplary tags 
are haloaromatic alkyl ethers that are detectable as their trimethylsilyl ethers at less than 
femtomolar levels by electron capture gas chromatography (ECGC). Variations in the 
length of the alkyl chain, as well as the nature and position of the aromatic halide 
substituents. permit the synthesis of at least 40 such tags, which in principle can encode 
2^ (e.g., upwards of 10l2) different molecules. In the original report (Ohlmeyer et al., 
supra) the tags were bound to about 1% of the available amine groups of a peptide 
library via a photocleavable O-nitrobenzyl Unker. This approach is convenient when 
preparing combinatorial libraries of peptides or other amine-containing molecules. A 
more versatile system has, however, been developed that permits encoding of essentially 
any combinatorial library. Here, the ligand is attached to the solid support via the 
photocleavable linker and the tag is attached through a catechol ether linker via carbene 
msertion into the bead matrix (Nestler et al. (1994) J Org Chem 59:4723-4724). This 
Orthogonal attachment sti-ategy permits the selective detachment of library members for 
bioassay in solution and subsequent decoding by ECGC after oxidative detachment of the 
tag sets. 

Binary encoding with tags, e.g., electrophone tags, has been particularly useful in 
defining selective interactions of substrates with synthetic receptors (Borchardt et al 
(1994) J Am Chem Soc 116:373-374). and model systems for understanding the binding 
and catalysis of biomolecules. Even using detailed molecular modeling, the identification 
of tiie selectivity preferences for syntiietic receptors has required the manual synthesis of 
dozens of potential substrates. The use of encoded libraries makes it possible to rapidly 
examine all the members of a potential binding set. The use of binary-encoded Ubraries 
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has made the determination of binding selectivities so facile that structural selectivity has 
been reported for four novel synthetic macrobicyclic and tricyclic receptors in a single 
communication (Wennemers et al. (1995) J Org Chem 60:1108-1109; and Yoon et al. 

(1994) Tetrahedron Lett 35:8557-8560) using the encoded library mentioned above. 
5 Similar facility in defining specificity of interaction would be expected for many other 

biomolecules. 

Although the several amide-linked libraries in the art employ binary encoding with 
the electrophone tags attached to amine groups, attaching these tags directly to the bead 
matrix provides far greater versatility in the structures that can be prepared in encoded 
10 combinatorial libraries. Attached in this way, the tags and their linker are nearly as 
unreactive as the bead matrix itself Two binary-encoded combinatorial libraries have 
been reported where the tags are attached directly to the solid phase (Ohlmeyer et al. 

(1995) PNAS 92:6027-6031) and provide guidance for generating the subject HDx 
inhibitor compound library. Both libraries were constructed using an orthogonal 

15 attachment strategy in which the library member was linked to the solid support by a 
photolabile linker and the tags were attached through a linker cleavable only by vigorous 
oxidation. Because the library members can be repetitively partially photoeluted from the 
solid support, library members can be utilized in multiple assays. Successive photoelution 
also permits a very high throughput iterative screening strategy: first, multiple beads are 

20 placed in 96-weIl microtiter plates; second, ligands are partially detached and transferred 
to assay plates; third, a bioassay identifies the active wells; fourth, the corresponding 
beads are rearrayed singly into new microtiter plates; fifth, single active compounds are 
identified; and sixth, the structures are decoded. 

The above approach was employed in screening for carbonic anhydrase (CA) 
25 binding and identified compounds which exhibited nanomolar afifinities for CA. Unlike 
sequenceable tagging, a large number of structures can be rapidly decoded from binary- 
encoded libraries (a single ECGC apparatus can decode 50 structures per day). Thus, 
binary-encoded libraries can be used for the rapid analysis of structure-activity 
relationships and optimization of both potency and selectivity of an active series. The 
30 synthesis and screening of large unbiased binary encoded HDx inhibitor compound 
libraries for lead identification, followed by preparation and analysis of smaller focused 
libraries for lead optimization, offers a particularly powerfijl approach to discovery of 
HDx inhibitor compounds. 

HEhc inhibitor compounds can be synthesized on solid support by appropriate 
35 fiinctionalization for attachment to a solid matrix, or alternatively, by solution-phase 
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synthesis followed by immobilization through an appropriate functional group. Thus, in 
an illustrative embodiment, an HDx inhibitor compound, which is analogous to 
trichostatin, can be synthesized on a solid support by attachment through an amino group 
of the specificity element A, as shown in Figure 7. The soUd support is preferably 
capable of withstanding synthetic conditions required to synthesize the requisite 
compounds. The compound can preferably be released from the solid support, e.g.. by 
selective cleavage of an amide bond. 

The synthetic steps employed to synthesize compounds on solid support are 
preferably selected to allow a wide variety of residues (e.g., building blocks) to be 
coupled to the immobilized moieties, preferably under mild conditions. Suitable reaction 
chemistries include well-known carbon-carbon bond forming reactions such as the StiUe 
and Suzuki coupUngs, as well as Homer-Emmons reactions, Ni/Cr mediated couplings, 
and the like. Particularly prefen-ed coupling reactions can be performed in the presence 
of water and do not require harsh conditions or expensive reagents. 

Thus, in an exemplary synthesis shown in Figure 7, substituted N-methyl-4- 
(tributyltin)anilines (in which Rj represents one or more substitutions, e.g., hydrogen, 
halogen, alkyl, alkoxy. and the Uke) are coupled in a plurality of reaction vessels to beads 
of a solid support (e.g., AfBgel). The beads are further divided into a plurality of reaction 
vessels, and suspended in a solvent such as DMF, and one acid chloride buUding block 
(con-esponding to linking element B) is introduced into each vessel (R2 and R3 represent, 
e.g., hydrogen, halogen, alkyl, and the like; and the broken line represents an optional 
double bond). The reactions are stin-ed under an inert gas (e.g. nitrogen) and a palladium 
catalyst (e.g., Pd(PPh3)4) is added (0.1-1.0 mol%). The reaction is stin-ed for 1-24 
hours. Upon completion of the reaction, the beads are washed, and placed in a plurality 
of vessels. The aldehyde moiety is deprotected by mild acid treatment (e.g., PPTS in 
MeOH), and the beads are again washed and placed in a plurality of reaction vessels, and 
the beads are suspended in dry acetonitrile. One building block (corresponding to the 
reactive element C) is then added to each reaction vessel. As iUustratively shown in 
Figure 7, a plurality of phosphonates can be employed (R4 represents, e.g., alkyl, alkenyl, 
alkynyl, alkoxy. and the like). A Homer-Emmons reaction is perfonned by addition of 
LiCl (1.1 equiv.) and diisopropylethylamine (DIPEA) or DBU (1.2 equiv). Upon 
completion of the reaction, the beads are washed with water and acetonitrile, and then 
dried to yield a library of candidate HDx inhibitor compounds on solid support. The 
compounds can then be released from the solid support into solution; or the compounds 
35 can be screened while attached to the solid support. 
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The above combinatorial synthesis can be performed in an encoded mode, e.g., 
the binary tagging method described supra, by addition of the appropriate tag for each 
monomer. In this mode, after each reaction has been performed and the corresponding 
tag attached, the beads from all reactions can be recombined and then divided into 
5 aliquots for further derivatization. This method provides the advantage of ease of 
handling when large libraries are to be synthesized. Regardless of the method of 
synthesis, the combinatorial library can be screened for activity according to known 
methods (see, e.g., Gordon et al,, supra). 

10 In another aspect, the present invention provides pharmaceutically acceptable 

compositions which comprise a therapeutically-efFective amount of one or more of the 
compounds described above, formulated together with one or more pharmaceutically 
acceptable carriers (additives) and/or diluents. As described in detail below, the 
pharmaceutical compositions of the present invention may be specially formulated for 

15 administration in solid or liquid form, including those adapted for the following: (1) oral 
administration, for example^ drenches (aqueous or non-aqueous solution;^ or 
suspensions), tablets, boluses, powders, granules, pastes for application to the tongue; (2) 
parenteral administration, for example, by subcutaneous, intramuscular or intravenous 
injection as, for example, a sterile solution or suspension; (3) topical application, for 

20 example, as a cream, ointment or spray applied to the skin; or (4) intravaginally or 
intrarectally, for example, as a pessary, cream or foam. 

The phrase "therapeutically-effective amount" as used herein means that amount 
of a compound, material, or compo^tion comprising a deacetylase inhibitor of the present 
invention which is eflfective for producing some desired therapeutic effect by inhibiting 
25 histone deacetylation in at least a sub-population of cells in an animal and thereby 
blocking the biological consequences of that event in the treated cells, at a reasonable 
benefit/risk ratio applicable to any medical treatment. 

The phrase "pharmaceutically acceptable" is employed herein to refer to those 
compounds, materials, compositions, and/or dosage forms which are, within the scope of 
30 sound medical judgment, suitable for use in contact with the tissues of human beings and 
animals without excessive toxicity, irritation, allergic response, or other problem or 
complication, conmiensurate with a reasonable benefit/risk ratio. 

The phrase "pharmaceutically-acceptable carrier" as used herein means a 
pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid 
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filler, diluent, excipient, solvent or encapsulating material, involved in carrying or 
transporting the subject deacetylase inhibitor agent fi-om one organ, or portion of the 
body, to another organ, or portion of the body. Each carrier must be "acceptable" in the 
sense of being compatible with the other ingredients of the formulation and not injurious 
5 to the patient. Some examples of materials which can serve as pharmaceutically- 
acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, 
such as com starch and potato starch; (3) cellulose, and its derivatives, such as sodium 
carboxymethyl cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; 
(5) malt; (6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes; 
10 (9) oils, such as peanut oil, cottonseed oil, safflower oU, sesame oil, olive oil, com oil and 
soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, 
sorbitol, mannitol and polyethylene glycol; (12) esters, such as ethyl olcate and ethyl 
laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum 
hydroxide; (15) alginic acid; (16) pyrogen-fi-ee water; (17) isotonic saline; (18) Rmger's 
solution; (19) ethyl alcohol; (20) phosphate buffer solutions; and (21) other non-toxic 
compatible substances employed in pharmaceutical fomiulations. 

As set out above, certain embodiments of the present deacetylase inhibitors may 
contain a basic functional group, such as amino or alkylamino, and are, thus, capable of 
fomiing pharmaceutically-acceptable salts with pharmaceutically-accepUble acids. The 
term "pharmaceutically-acceptable salts" in this respect, refers to the relatively non-toxic, 
inorganic and organic acid addition salts of compounds of the present invention. These 
salts can be prepared in situ during the final isolation and purification of the compounds 
of the invention, or by separately reacting a purified compound of the invention in its fi-ee 
base form with a suitable organic or inorganic acid, and isolating the salt thus formed, 
i Representative salts include the hydrobromide, hydrochloride, sulfate, bisulfate. 
phosphate, nitrate, acetate, valerate, oleate, palmitate, stearate, laurate, benzoate, lactate, 
phosphate, tosylate. citrate, maleate, fiimarate, succinate, tartrate, napthylate, mesylate] 
glucoheptonate. lactobionate, and laurylsulphonate salts and the like. (See, for example, 
Berge et al. (1977) "Pharmaceutical Salts", J. Pharm. Sci. 66: 1-19) 

In other cases, the deacetylase inhibitory compounds of the present invention may 
contain one or more acidic fiinctional groups and, thus, are capable of forming 
pharmaceutically-acceptable salts with pharmaceutically-acceptable bases. The term 
"pharmaceutically-acceptable salts" in these instances refers to the relatively non-toxic, 
inorganic and organic base addition salts of compounds of the present invention. These 
salts can likewise be prepared in situ during the final isolation and purification of the 
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compounds, or by separately reacting the purified compound in its firee acid form with a 
suitable base, such as the hydroxide, carbonate or bicarbonate of a pharmaceutically- 
acceptable metal cation, with ammonia, or with a pharmaceutically-acceptable organic 
primary, secondary or tertiary amine. Representative alkali or alkaline earth salts include 
5 the lithium, sodium, potassium, calcium, magnesium, and aluminum salts and the like. 
Representative orgaaiic amines usefiil for the formation of base addition salts include 
ethylamine, diethylamine, ethylenediamine, ethanolamine, diethanolamine, piperazine and 
the like. (See, for example, Berge et al., supra) 

Wetting agents, emulsifiers and lubricants, such as sodium lauryl sulfate and 
10 magnesium stearate, as well as coloring agents, release agents, coating agents, 
sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be 
present in the compositions. 

Examples of pharmaceutically-acceptable antioxidants include: (1) water soluble 
antioxidants, such as ascorbic acid, cysteine hydrochloride, sodium bisulfate, sodium 
15 metabisulfite, sodium sulfite and the like; (2) oil-soluble antioxidants, such as ascorbyl 
palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, 
propyl gallate, alpha-tocopherol, and the like; and (3) metal chelating agents, such as 
citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric 
acid, and the like. 

20 Formulations of the present invention include those suitable for oral, nasal, topical 

(including buccal and sublingual), rectal, vaginal and/or parenteral administration. The 
formulations may conveniently be presented in unit dosage form and may be prepared by 
any methods well known in the art of pharmacy. The amount of active ingredient which 
can be combined with a carrier material to produce a single dosage form will vary 

25 depending upon the host being treated, the particular mode of administration. The 
amount of active ingredient which can be combined with a carrier material to produce a 
single dosage form will generally be that amount of the deacetylase inhibitor which 
produces a therapeutic effect. Generally, out of one hundred per cent, this amount will 
range from about 1 per cent to about ninety-nine percent of active ingredient, preferably 

30 fi-om about 5 per cent to about 70 per cent, most preferably from about 10 per cent to 
about 30 per cent. 

Methods of preparing these formulations or compositions include the step of 
bringing into association a compound of the present invention with the carrier and, 
optionally, one or more accessory ingredients. In general, the formulations are prepared 
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by uniformiy and intimately bringing into association a deacetylase inhibitor of the present 
invention with liquid carriers, or finely divided solid carriers, or both, and then, if 
necessary, shaping the product. 

Formulations of the invention suitable for oral administration may be in the form 
of capsules, cachets, pills, tablets, lozenges (using a flavored basis, usually sucrose and 
acacia or tragacanth), powders, granules, or as a solution or a suspension in an aqueous 
or non-aqueous liquid, or as an oil-in-water or water-in-oil liquid emulsion, or as an elbdr 
or syrup, or as pastilles (using an inert base, such as gelatin and glycerin, or sucrose and 
acacia) and/or as mouth washes and the like, each containing a predetermined amount of 
a compound of the present invention as an active ingredient. A deacetylase inhibitor of 
the present invention may also be administered as a bolus, electuary or paste. 

In solid dosage forms of the invention for oral administration (capsules, tablets, 
pills, dragees, powders, granules and the like), the active ingredient is mbced with one or 
more phamiaceutically-acceptable carriers, such as sodium citrate or dicalcium 
phosphate, and/or any of the following. (1) fillers or extenders, such as starches, lactose, 
sucrose, glucose, mannitol, and/or silicic acid; (2) binders, such as. for example,' 
carboxymethylcellulose. alginates, gelatin, polyvinyl pyrrolidone, sucrose and/or acacia,' 
(3) humectants, such as glycerol; (4) disintegrating agents, such as agar-agar, calcium 
carbonate, potato or tapioca starch, alginic acid, certain sihcates, and sodium clrbonate; 
(5) solution retarding agents, such as paraffin; (6) absorption accelerators, such as 
quaternary ammonium compounds; (7) wetting agents, such as, for example, cetyl alcohol 
and glycerol monostearate; (8) absorbents, such as kaolin and bentonite clay; (9) 
lubricants, such a talc, calcium stearate, magnesium stearate, solid polyethylene glycols 
sodium lauiyl sulfate, and nuxtures thereof; and (10) coloring agents. In the case of 
capsules, tablets and pills, the pharmaceutical compositions may also comprise buffering 
agents. SoUd compositions of a similar type may also be employed as fillers in sofk and 
hard-filled gelatin capsules using such excipients as lactose or milk sugars, as weU as high 
molecular weight polyethylene glycols and the like. 

A tablet may be made by compression or molding, optionally with one or more 
accessory ingredients. Compressed tablets may be prepared using binder (for example, 
gelatin or hydroxypropylmethyl cellulose). lubricant, inert diluent, preservative, 
dismtegrant (for example, sodium starch glycolate or cross-linked sodium carboxymethyl 
cellulose), surface-active or dispersing agent. Molded tablets may be made by molding in 
a smtable machine a mixture of the powdered deacetylase inhibitor moistened with an 
35 inert liquid diluent. 
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The tablets, and other solid dosage forms of the pharmaceutical compositions of 
the present invention, such as dragees, capsules, pills and granules^ may optionally be 
scored or prepared with coatings and shells, such as enteric coatings and other coatings 
well known in the pharmaceutical-formulating art. They may also be formulated so as to 
5 provide slow or controlled release of the active ingredient therein using, for example, 
hydroxypropylmethyl cellulose in varying proportions to provide the desired release 
profile, other polymer matrices, liposomes and/or microspheres. They may be sterilized 
by, for example, filtration through a bacteria-retaining filter, or by incorporating 
sterilizing agents in the form of sterile solid compositions which can be dissolved in sterile 

10 water, or some other sterile injectable medium immediately before use. These 
compositions may also optionally contain opacifying agents and may be of a composition 
that they release the active ingredient(s) only, or preferentially, in a certain portion of the 
gastrointestinal tract, optionally, in a delayed manner. Examples of embedding 
compositions which can be used include polymeric substances and waxes. The active 

15 ingredient can also be in micro-encapsulated form, if appropriate, with one or more of the 
above-described excipients. 

Liquid dosage forms for oral administration of the deacetylase inhibitors of the 
invention include pharmaceutically acceptable emulsions, microemulsions, solutions, 
suspensions, syrups and elixirs. In addition to the active ingredient, the liquid dosage 

20 forms may contain inert diluents commonly used in the art, such as, for example, water or 
other solvents, solubilizing agents and emulsifiers, such as ethyl alcohol, isopropyl 
alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene glycol, 
1,3-butylene glycol, oils (in particular, cottonseed, groundnut, com, germ, olive, castor 
and sesame oils), glycerol, tetrahydrofiiryl alcohol, polyethylene glycols and fatty acid 

25 esters of sorbitan, and mixtures thereof 

Besides inert diluents, the oral compositions can also include adjuvants such as 
wetting agents, emulsifying and suspending agents, sweetening, flavoring, coloring, 
perfiiming and preservative agents. 

Suspensions, in addition to the active deacetylase inhibitor, may contain 
30 suspending agents as, for example, ethoxylated isostearyl alcohols, polyoxyethylene 
sorbitol and sorbitan esters, microcrystalline cellulose, aluminum metahydroxide, 
bentonite, agar-agar and tragacanth, and mixtures thereof 

Formulations of the pharmaceutical compositions of the invention for rectal or 
vaginal administration may be presented as a suppository, which may be prepared by 
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mixing one or more compounds of the invention with one or more suitable noninitating 
exapients or carriers comprising, for example, cocoa butter, polyethylene glycol a 
suppository wax or a salicylate, and which is solid at room temperature, but liquid at 
body temperature and. therefore, will melt in the rectum or vaginal cavity and release the 
active deacetylase inhibitor. 

Formulations of the present invention which are suitable for vaginal administration 
also mclude pessaries, tampons, creams, gels, pastes, foams or spray fonnulations 
comaimng such carriers as are known in the art to be appropriate. 

Dosage forms for the topical or transdermal administration of a deacetylase 
inhibitor of this invention include powders, sprays, ointments, pastes, creams, lotions 
gels, solutions, patches and inhalants. The active compound may be mked under sterile 
conditions with a pharmaceutically-acceptable carrier, and with any preseivatives, buffers, 
or propellants which may be required. 

The ointments, pastes, creams and gels may contain, in addition to an active 
deacetylase inhibitor of this invention, excipients. such as animal and vegetable fats oils 
waxes, paraffins, starch, tragacanth, cellulose derivatives, polyethylene glycols, silicones' 
bentonites, sihcic acid, talc and zinc oxide, or mixtures thereof 

Powders and sprays can contain, in addition to a compound of this invention, 
excip,ents such as lactose, talc, silicic acid, aluminum hydroxide, calcium silicates and 
polyamide powder, or mixtures of these substances. Sprays can additionally contain 
customaiy propellants, such as chlorofluorohydrocarbons and volatile unsubstituted 
hydrocarbons, such as butane and propane. 

Transdermal patches have the added advantage of providing controlled delivery of 
a compound of the presem invention to the body. Such dosage forms can be made by 
dissolvrng or dispersing the deacetylase inhibitor in the proper medium. Absorption 
enhancers can also be used to increase the flux of the deacetylase inhibitor across the 
skm. The rate of such flux can be controlled by either providing a rate controlling 
membrane or dispersing the deacetylase inhibitor in a polymer matrix or gel. 

Ophthalmic formulations, eye ointments, powders, solutions and the like are also 
contemplated as being within the scope of this invenUon. 

Pharmaceutical compositions of this invention suitable for parenteral 
admmistration comprise one or more deacetylase inhibitors of the invention in 
combination with one or more pharmaceutically-acceptable sterile isotonic aqueous or 
nonaqueous solutions, dispersions, suspensions or emulsions, or sterile powders which 
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may be reconstituted into sterile injectable solutions or dispersions just prior to use, 
which may contain antioxidants, buffers, bacteriostats, solutes which render the 
formulation isotonic with the blood of the intended recipient or suspending or thickening 
agents. 

5 Examples of suitable aqueous and nonaqueous carriers which may be employed in 

the pharmaceutical compositions of the invention include water, ethanol, polyols (such as 
glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures 
thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyl 
oleate. Proper fluidity can be maintained, for example, by the use of coating materials, 
1 0 such as lecithin, by the maintenance of the required particle size in the case of dispersions, 
and by the use of surfactants. 

These compositions may also contain adjuvants such as preservatives, wetting 
agents, emulsifying agents and dispersing agents. Prevention of the action of 
microorganisms may be ensured by the inclusion of various antibacterial and antifungal 
15 agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also 
be desirable to include isotonic agents, such as sugars, sodium chloride, and the like into 
the compositions. In addition, prolonged absorption of the injectable pharmaceutical form 
may be brought about by the inclusion of agents which delay absorption such as 
aluminum monostearate and gelatin. 

20 In some cases, in order to prolong the effect of a drug, it is desirable to slow the 

absorption of the drug from subcutaneous or intramuscular injection. This may be 
accomplished by the use of a liquid suspension of crystalline or amorphous material 
having poor water solubility. The rate of absorption of the drug then depends upon its 
rate of dissolution which, in turn, may depend upon crystal size and crystalline form. 

25 Alternatively, delayed absorption of a parenterally-administered drug form is 
accomplished by dissolving or suspending the drug in an oil vehicle. 

Injeaable depot forms are made by forming microencapsule matrices of the 
subject deacetylase inhibitors in biodegradable polymers such as polylactide- 
polyglycolide. Depending on the ratio of drug to polymer, and the nature of the particular 
30 polymer employed, the rate of drug release can be controUed. Examples of other 
biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot injectable 
formulations are also prepared by entrapping the drug in liposomes or microemulsions 
which are compatible with body tissue. 
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When the compounds of the present invention are administered as 
pharmaceuticals, to humans and animals, they can be given per se or as a pharmaceutical 
composition containing, for example, 0.1 to 99.5% (more preferably. 0.5 to 90%) of 
active ingredient in combination with a pharmaceutically acceptable carrier. 
5 The preparations of the present invention may be given orally, parenterally, 

topically, or rectally. They are of course given by forms suitable for each administration 
route. For example, they are administered in tablets or capsule form, by injection, 
inhalation, eye lotion, ointment, suppository, etc. administration by injection, infusion or 
inhalation; topical by lotion or ointment; and rectal by suppositories. Oral administration 
1 0 is preferred. 

These deacetylase inhibitor may be administered to humans and other animals for 
therapy by any suitable route of administration, including orally, nasally, as by. for 
example, a spray, rectally, intravaginaliy, parenterally. intracistemally and topically, as by 
powders, ointments or drops, including buccally and subUngually. 

Regardless of the route of administration selected, the compounds of the present 
invention, which may be used in a suitable hydrated fonn, and/or the pharmaceutical 
compositions of the present invention, are formulated into pharmaceuticaUy-acceptable 
dosage forms by conventional methods known to those of skill in the art. 

Actual dosage levels of the active ingredients in the pharmaceutical compositions 
of this invention may be varied so as to obtain an amount of the active ingredient which is 
effective to achieve the desired therapeutic response for a particular patient, composition, 
and mode of administration, without being toxic to the patient. 

The selected dosage level will depend upon a variety of factors including the 
activity of the particular deacetylase inhibitor employed, or the ester, salt or amide 
thereof, the route of administration, the time of administration, the rate of excretion of 
the particular compound being employed, the duration of the treatment, other dings, 
compounds and/or materials used in combination with the particular deacetylase inhibitor 
employed, the age, sex, weight, condition, general health and prior medical history of the 
patient being treated, and like factors well known in the medical arts. 

A physician or veterinarian having ordinary skiU in the art can readily determine 
and prescribe the effective amount of the pharmaceutical composition required. For 
example, tiie physician or veterinarian could start doses of the compounds of the 
mvention employed in the pharmaceutical composition at levels lower than that required 
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in order to achieve the desired therapeutic effect and gradually increase the dosage until 
the desired effect is achieved. 

Another aspect of the present invention relates to a method of inducing and/or 
maintaining a differentiated state, enhancing survival, and/or inhibiting (or alternatively 
5 potentiating) proliferation of a cell, by contacting the cells with an agent which modulates 
//Dx-dependent transcription. For instance, it is contemplated by the invention that, in 
light of the present finding of an apparently broad involvement of HDx proteins in the 
control of chromatin structure and, thus, transcription and replication, the subject method 
could be used to generate and/or maintain an array of different tissue both in vitro and in 
10 vivo. An ''HDx therapeutic," whether inhibitory or potentiating with respect to 
modulating histone deacetylation, can be, as appropriate, any of the preparations 
described above, including isolated polypeptides, gene therapy constructs, antisense 
molecules, peptidomimetics or agents identified in the drug assays provided herein. 

The HDx compounds of the present invention are likely to play an important role 
15 in the modulation of cellular proliferation. There are a wide variety of pathological cell 
proliferative conditions for which HDx therapeutics of the present invention may be used 
in treatment. For instance, such agents can provide therapeutic benefits where the 
general strategy being the inhibition of an anomalous cell proliferation. Diseases that 
might benefit firom this methodology include, but are not limited to various cancers and 
20 leukemias, psoriasis, bone diseases, fibroproliferative disorders such as involving 
connective tissues, atherosclerosis and other smooth muscle proliferative disorders, as 
well as chronic inflanunation. 

In addition to proliferative disorders, the present invention contemplates the use 
of HDx therapeutics for the treatment of differentiative disorders which result fi'om, for 

25 example, de-differentiation of tissue which may (optionally) be accompanied by abortive 
reentry into mitosis, e.g. apoptosis. Such degenerative disorders include chronic 
neurodegenerative diseases of the nervous system, including Alzheimer's disease, 
Parkinson's disease, Huntington's chorea, amylotrophic lateral sclerosis and the like, as 
well as spinocerebellar degenerations. Other differentiative disorders include, for 

30 example, disorders associated with connective tissue, such as may occur due to de- 
differentiation of chondrocytes or osteocytes, as well as vascular disorders which involve 
de-differentiation of endothelial tissue and smooth muscle cells, gastric ulcers 
characterized by degenerative changes in glandular cells, and renal conditions marked by 
failure to differentiate, e.g. Wilm's tumors. 
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It will also be apparent that, by transient use of modulators of HDx activities, in 
vivo reformation of tissue can be accomplished, e.g. in the development and maintenance 
of organs. By controlling the proliferative and diflferentiative potential for different cells, 
the subject HDx therapeutics can be used to reform injured tissue, or to improve grafting 
5 and morphology of transplanted tissue. For instance, HDx antagonists and agonists can 
be employed in a differential manner to regulate different stages of organ repair after 
physical, chemical or pathological insult. For example, such regimens can be utilized in 
repair of cartilage, increasing bone density, liver repair subsequent to a partial 
hepatectomy, or to promote regeneration of lung tissue in the treatment of emphysema. 
10 The present method is also applicable to cell culture techniques. 

In one embodiment, the HDx therapeutic of the present invention can be used to 
induce differentiation of uncommitted progenitor cells and thereby give rise to a 
committed progenitor cell, or to cause further restriction of the developmental fate of a 
committed progenitor cell towards becoming a terminally-differentiated cell. For 
15 example, the present method can be used in vitro or in vivo to induce and/or maintain the 
differentiation of hematopoietic cells into erythrocytes and other cells of the 
hematopoietic system. In an illustrative embodiment, the effect of erythropioetin (EPO) 
on the growth of EPO-responsive erythroid precursor cells is increased to influence their 
differentiation into red blood cells. For example, as a result of administering an inhibitor 
20 of histone deacetylation, the amount of EPO, or other diferentiating agent, required for 
growth and/or differentiation is reduced (PCT/US92/07737). Accordingly, the HDx 
therapeutics of the present invention, particularly those which antagonize HDx 
deacetylase activity, can be administered alone or in conjunction with EPO and in a 
suitable carrier to vertebrates to promote erythropoiesis. Alternatively, cells could be 
treated ex vivo. Such treatment is contemplated in the treatment of a variety of disease 
states, including in individuals who require bone marrow transplants (e.g. patients with 
aplastic anemia, acute leukemias, recurrent lymphomas, or solid tumors). 

To illustrate, prior to receiving a bone marrow transplant, a recipient is prepared 
by ablating or removing endogenous hematopoietic stem cells. Such treatment is usually 
carried out by total body irradiation or delivery of a high dose of an alkylating agent or 
other chemotherapeutic, cytotoxic agent, Anklesaria, et al. (1987) PNAS 84:7681-7685). 
Following preparation of the recipient, donor bone marrow cells are injected 
intravenously. Optionally, the HDx therapeutics of the present invention could be 
contacted with the cells ex vivo or administered to the subject with the reimplanted ceUs. 
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It is also contemplated that there may be cell-type specific HDx proteins, and/or 
that some cell types may be more sensitive to modulation of HDx deacetylase activities. 
Even within a cell type, the stage of differentiation or position in the cell cycle could 
influence their response to an HDx therapeutic. Accordingly, the present invention 
5 contemplates the use of agents which modulate histone deacetylase activity to specifically 
inhibit or activate certain cell types. In an illustrative example, T cell proliferation could 
be preferentially inhibited in order to induce tolerance by using a procedure similar to that 
for inducing tolerance using sodium butyrate (see, for example, PCT/LJS93/03045). To 
illustrate, the HDx therapeutics of the present invention may be used to induce antigen- 

10 specific tolerance in any situation in which it is desirable to induce tolerance, such as 
autoimmune diseases, in allogeneic or xenogeneic transplant recipients, or in graft versus 
host (GVH) reactions. According to the invention, tolerance will typically be induced by 
presenting the tolerizing compound (e.g., an HDx inhibitor) substantially 
contemporaneously with the antigen, i.e. reasonably close together in time vwth the 

15 antigen. In preferred embodiments the HDx therapeutic will be administered after 
presentation of the antigen, so that they will have their effect after the particular 
repertoire of Th cells begins to undergo clonal expansion. 

Yet another aspect of the present invention concerns the application of HDx 
therapeutics to modulating morphogenic signals , involved in organogenic pathways. 
20 Thus, it is contemplated by the invention that compositions comprising HDx therapeutics 
can also be utilized for both cell culture and therapeutic methods involving generation 
and maintenance of tissue. 

In a further embodiment of the invention, the subject HDx therapeutics will be 
useful in increasing the amount of protein produced by a cell or recombinant cell. The 

25 cell may include any primary cell isolated from any animal, cultured cells, immortalized 
cells, and established cell lines. The animal cells used in the present invention include 
cells which intrinsically have an ability to produce a desired protein; cells which are 
induced to have an ability to produce a desired protein, for example, by stimulation with 
a cytokine such as an interferon, an interleukin; genetically engineered cells into which a 

30 gene for a desired protein is introduced. The protein produced by the process could 
include any peptides or proteins, including peptide hormone or proteinaceous hormones 
such as any useful hormone, cytokine, interleukin, or protein which it may be desirable to 
have in purified form and/or in large quantity. 

Another aspect of the invention features transgenic non-human animals which 
35 express a heterologous HDx gene of the present invention, or which have had one or 
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more genomic HDx genes disrupted in at least one of the tissue or cell-types of the 
animal. Accordingly, the invention features an animal model for developmental diseases, 
which animal has one or more HDx allele which is mis-expressed. For example, a mouse 
can be bred which has one or more HDx alleles deleted or otherwise rendered inactive. 
Such a mouse model can then be used to study disorders arising from mis-expressed HDx 
genes, as well as for evaluating potential therapies for similar disorders. 

Another aspect of the present invention concerns transgenic animals which are 
comprised of cells (of that animal) which contain a transgene of the present invention and 
which preferably (though optionally) express an exogenous HDx protein in one or more 
cells in the animal. An HDx transgene can encode the wUd-type form of the protein, or 
can encode homologs thereof, including both agonists and antagonists, as well as 
antisense constructs. In preferred embodiments, the expression of the transgene is 
restricted to specific subsets of cells, tissues or developmental stages utilizing, for 
example, cis-acting sequences that control expression in the desired pattern. In the 
15 present invention, such mosaic expression of an HDx protein can be essential for many 
forms of lineage analysis and can additionally provide a means to assess the effects of, for 
example, lack olHDx expression which might grossly alter development in small patches 
of tissue within an otherwise normal embryo. Toward this and, tissue-specific regulatory 
sequences and conditional regulatory sequences can be used to control expression of the 
transgene in certain spatial patterns. Moreover, temporal patterns of expression can be 
provided by, for example, conditional recombination systems or prokaryotic 
transcriptional regulatory sequences. 

Genetic techniques which aUow for the expression of transgenes can be regulated 
via site-specific genetic manipulation in vivo are known to those skiUed in the art. For 
25 instance, genetic systems are available which allow for the regulated expression of a 
recombinase that catalyzes the genetic recombination a target sequence. As used herein, 
the phrase "target sequence" refers to a nucleotide sequence that is genetically 
recombined by a recombinase. The target sequence is flanked by recombinase 
recognition sequences and is generally either excised or inverted in cells expressing 
recombinase activity. Recombinase catalyzed recombination events can be designed such 
that recombination of the target sequence results in either the aaivation or repression of 
expression of one of the subject HDx proteins. For example, excision of a target 
sequence which interferes with the expression of a recombinant HDx gene, such as one 
which encodes an antagonistic homolog or an antisense transcript, can be designed to 
activate expression of that gene. This interference with expression of the protein can 
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result from a variety of mechanisms, such as spatiaJ separation of the HDx gene from the 
promoter element or an internal stop codon. Moreover, the transgene can be made 
wherein the coding sequence of the gene is flanked by recombinase recognition sequences 
and is initially transfected into cells in a 3' to 5' orientation with respect to the promoter 
element. In such an instance, inversion of the target sequence will reorient the subject 
gene by placing the 5' end of the coding sequence in an orientation with respect to the 
promoter element which allow for promoter driven transcriptional activation. 

In an illustrative embodiment, either the cre/loxP recombinase system of 
bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 
89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman 
et al. (1991) Science 251:1351-1355; PCX publication WO 92/15694) can be used to 
generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes 
the site-specific recombination of an intervening target sequence located between loxP 
sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre 
recombinase binds and are required for Cre recombinase mediated genetic recombination. 
The orientation of loxP sequences determines whether the intervening target sequence is 
excised or inverted when Cre recombinase is present (Abremski et al. (1984) J. Biol. 
Chem. 259:1509-1514); catalyzing the excision of the target sequence when the loxP 
sequences are oriented as direct repeats and catalyzes inversion of the target sequence 
20 when loxP sequences are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by 
promoter elements which are subject to regulatory control, e.g., tissue-specific, 
developmental stage-specific, inducible or repressibie by externally added agents. This 
regulated control will result in genetic recombination of the target sequence only in ceUs 
where recombinase expression is mediated by the promoter element. Thus, the activation 
expression of a recombinant HDx protein can be regulated via control of recombinase 
repression. 

Use of the cre/loxP recombinase system to regulate expression of a recombinant 
HDx protein requires the construction of a ti-ansgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the 
Cre recombinase and a recombinant HDx gene can be provided through the construction 
of "double" transgenic animals. A convenient method for providing such animals is to 
mate two transgenic animals each containing a transgene. e.g., an HDx gene and 
35 recombinase gene. 
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One advantage derived from initially constructing transgenic animals containing 
an HDx transgene in a recombinase-mediaied expressible format derives from the 
likelihood that the subject protein, whether agonistic or antagonistic, can be deleterious 
upon expression in the transgenic animal In such an instance, a founder population, in 
5 which the subject transgene is silent in all tissues, can be propagated and maintained. 
Individuals of this founder population can be crossed with animals expressing the 
recombinase in, for example, one or more tissues and/or a desired temporal pattern. 
Thus, the creation of a founder population in which, for example, an antagonistic HDx 
transgene is sUent will allow the study of progeny from that founder in which disruption 
1 0 of ^Dx mediated induction in a particular tissue or at certain developmental stages would 
result in, for example, a lethal phenotype 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the HDx transgene. Exemplary promoters and the corresponding 
15 trans-activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 
therapy-like methods wherein a gene encoding the trans-activating protein, e.g. a 
recombinase or a prokaryotic protein, is delivered to the tissue and caused to be 
expressed, such as in a cell-type specific manner. By this method, an HDx transgene 
could remain silent into adulthood until "turned on" by the introduction of the trans- 
activator. 

In an exemplary embodiment, the "transgenic non-human animals" of the 
invention are produced by introducing transgenes into the germline of the non-human 
animal. Embryonic target cells at various developmental stages can be used to introduce 

25 transgenes. Diflferent methods are used depending on the stage of development of the 
embryonic target cell. The zygote is the best target for micro-injection. In the mouse, the 
male pronucleus reaches the size of approximately 20 micrometers in diameter which 
allows reproducible injection of l-2pl of DNA solution. The use of zygotes as a target for 
gene transfer has a major advantage in that in most cases the injected DNA will be 

30 incorporated into the host gene before the first cleavage (Brinster et al. (1985) PNAS 
82:4438-4442). As a consequence, all cells of the transgenic non-human animal will cany 
the incorporated transgene. This will in general also be reflected in the efficient 
transmission of the transgene to offspring of the founder since 50% of the germ cells will 
harbor the transgene. Microinjection of zygotes is the preferred method for incorporating 

35 transgenes in practicing the invention. 
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Retroviral infection can also be used to introduce HDx transgenes into a non- 
human animal. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retroviral infection 
(Jaenich, R. (1976) PNAS 73:1260-1264), Efficient infection of the blastomeres is 
5 obtained by enzymatic treatment to remove the zona peliucida (Manipulating the Mouse 
Embryo, Hogan eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). 
The viral vector system used to introduce the transgene is typically a replication-defective 
retrovirus carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der 
Putten et al, (1985) PNAS 82:6148-6152). Transfection is easily and efficiently obtained 

10 by culturing the blastomeres on a monolayer of virus-producing cells (Van der Putten, 
supra; Stewart et al. (1987) EMBO J. 6:383-388). Alternatively, infection can be 
performed at a later stage. Virus or virus-producing cells can be injected into the 
blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the founders will be 
mosaic for the transgene since incorporation occurs only in a subset of the cells which 

15 formed the transgenic non-human animal. Further, the founder may contain various 
retroviral insertions of the transgene at different positions in the genome which generally 
will segregate in the offspring. In addition, it is also possible to introduce transgenes into 
the germ line by intrauterine retroviral infection of the midgestation embryo (Jahner et al. 
(1982) supra). 

20 A third type of target cell for transgene introduction is the embryonic stem cell 

(ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused 
with embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al, (1984) Nature 
309:255-258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) 
Nature 322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA 

25 transfection or by retrovirus-mediated transduction. Such transformed ES cells can 
thereafter be combined with blastocysts from a non-human animal. The ES cells 
thereafter colonize the embryo and contribute to the germ line of the resulting chimeric 
animal. For review see Jaenisch, R. (1988) Science 240:1468-1474. 

Methods of making HDx knock-out or disruption transgenic animals are also 
30 generally known. See, for example, Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Recombtnase dependent 
knockouts can also be generated, e.g. by homologous recombination to insert 
recombinase target sequences flanking portions of an endogenous HDx gene, such that 
tissue specific and/or temporal control of inactivation of an HDx allele can be controlled 
35 as above. 
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ExempUfication 

The invention, now being generally described, will be more readily understood by 
reference to the following examples, which are included merely for purposes of 
illustration of certain aspects and embodiments of the present invention and are not 
intended to limit the invention. 



Example I 

Trapoxin is a microbially derived cyclotetrapeptide that inhibits histone 
10 deacetylation in vivo and causes mammalian cells to arrest in the cell cycle. A trapoxin 
affinity matrix was used to isolate two nuclear proteins that copurified with histone 
deacetylase activity. Both proteins were identified by peptide microsequencing, and a 
cDNA encoding the histone deacetylase catalytic subunit (HDl) was cloned from a 
Jurkat T cell library. As the predicted protein is highly similar to the yeast transcriptional 
15 regulator RPD3, this study supports a role for histone deacetylase as a key regulator of 
eukaryotic transcription. 

A requirement for a functional histone deacetylase in cell cycle progression has 
been implicated by the discovery that two cytostatic agents, trapoxin and trichostatin 
(Figure 1 A), inhibit histone deacetylation in cultured mammalian cells and in fractioned 
20 cell extracts (4). In addition to causing Gi and G2 phase cell cycle arrest, these natural 
products alter gene expression and induce certain mammalian cell fines to differentiate. 
Whereas sodium butyrate also has these properties, both trapoxin and trichostatin are five 
orders of magnitude more potent. 

Trapoxin is an "irreversible" inhibitor of histone deacetylase activity and its 
25 molecular structure offers clues as to how it could form a covalent bond with a 
nucleophilic active site residue. First, trapoxin contains an electrophilic epoxyketone that 
is essential for biological activity (5). Second, the aliphatic epoxyketone side chain is 
approximately isosteric with N-acetyl lysine (Figure lA). Trapoxin likely acts as a 
substrate mimic, with epoxyketone poised to alkylate an active site nucleophile. We 
30 therefore regarded trapoxin as a tool that could reveal the molecular identity of histone 
deacetylase, so that its role in transcriptional regulation and cell cycle progression could 
be elucidated. 
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Tritium-labeled trapoxin was prepared by total synthesis and used to identify 
trapoxin binding protein in crude extracts from bovine thymus. We used a charcoal 
precipitation assay to detect a specific trapoxin binding activity primarily in the nuclear 
fraction of the extracts (6). The binding activity was saturable with nanomolar 
5 concentrations of [^HTjtrapoxin and was completed by the simultaneous addition of 
unlabled trapoxin. Trichostatin also competed with [^H]trapoxin (for synthesis, see 
Example 2), suggesting that both of these compounds exert their cellular effects by 
targeting the same molecule. 

If trapoxin and trichostatin induce cell cycle arrest by directly inhibiting histone 

10 deacetylase, then the binding and enzymatic activities should copurify. To investigate 
this possibility, we fractioned nuclear thymus proteins by ammonium sulfate precipitation 
and Mono Q anion exchange chromatography. 

Briefly, thymocytes (-12 g) prepared from firesh bovine thymus were 
homogenized in hypotonic lysis buffer [20 mM tris (pH 7.8), 20 mM NaCl, 1 mM EDTA, 

15 10% glycerol, ImM PMSF, ImM benzamidine, 10 jag/ml each of pepstatin, aprotinin, 
and leupeptin] by mechanical disruption and the nuclei were isolated by centrifugation at 
3000g. Nuclei were resuspended in lysis buffer and the proteins were extracted with 0.4 
M ammonium sulfate. The viscous lysate was sonicated and clarified by centrifugation at 
100,000g for one hour. Proteins were then precipitated with 90% saturated ammonium 

20 sulfate and recovered by centrifugation (100,000g, one hour). After through dialysis 
against Q bufifer (25 mM tris pH 8, 10 mM NH CI, 0.25 mM EDTA, 10% glycerol), a 
portion of the nuclear proteins (-12 mg total protein) was loaded onto a HR 10/10 Mono 
Q column (Pharmacia). The column was washed with 25ml Q bufifer and eluted with a 
50 ml linear gradient of 10 to 500 mM NH4 CI. The column was further washed with 25 

25 ml 500 mM NH4 and 25 nJ 1 M histone deacetylase activities or further purified with 
the K-trap aflfinity matrix. All procedures were done at 4«^C. 

Two peaks of histone deacetylase activity eluted from the Mono Q column 
between 250 and 350 mM NH4CI (Figure IB). Trapoxin binding activity, as revealed by 
the charcoal precipitation assay (40 nM pH]trapoxin), precisely coeluted with the histone 

30 deacetylase peaks. Furthermore, all detectable histone deacetylase activity was abolished 
by treatment with either trapoxin or trichostatin (20 nM). Similar results were obtained 
with Mono Q fractioned nuclear extracts prepared form human Jurkat T cells. 

To purify the histone deacetylase further, we synthesized an affinity matrix based 
on the trapoxin structure. Because trapoxin itself is not amenable to derivatization and 
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the epoxyketone side chain is indispensable for activity, we chose to replace one of the 
phenylalanine residues of trapoxin's cyclic core with a lysine that could then be covalently 
linked to a solid support. This molecule, which we call K-trap, was prepared by a twenty 
step synthesis starting with commercially available (R)-proline and (S,S)- threiitol 
5 acetonide (Figure 2 A) (see Example 3). Synthetic K-trap inhibited pH]thymidine 
incorporation in MG-63 human osteosarcoma cells with a potency approximately one 
tenth that of trapoxin. In vitro histone deacetylase activity was also inhibited potently by 
this compound (complete inactivation at 20 nM) (8). 

K-trap was deprotected with Pd(Ph3P)4 and coupled to an activated agarose 
10 matrix (Figure 2A). Mono Q fractions containing nuclear proteins from bovine thymus 
were incubated with the K-trap affinity matrix and then tested for both trapoxin binding 
and histone deacetylase activity. Both activities were depleted (90%) by treatment with 
the K-trap matrix, yet a control matrix capped with ethanolamine had no effect on either 
activity (8). Bound polypeptides were eluted by boiling the matrix in 1% SDS buffer and 
15 separated b polyacrylamide gel electrophoresis. In vitro binding experiments with soluble 
[^HJtrapoxin indicated that the radiolabel is released into solution following protein 
denaturation with SDS or gunaidinium hydrochloride. Thus, trapoxin binding proteins 
were expected to elute from the affinity matrix with SDS. 

The silver stained gel of the affinity matrix eluates revealed six major polypeptides 
20 with apparent molecular sizes between 45 and 50kD (Figure 2B). The interaction 
between bovine p46-p50 and the K-trap matrix appeared to be specific, because these 
proteins were not retained when the incubation was done in the presence of either 
trapoxin or trichostatin (Figure 2B), nor were they structurally unrelated histone 
deacetylase inhibitor, trichostatin, to prevent p46-p50 from binding to the K-trap matrix 
25 implies that one or more of these polypeptides constitute the biologically relevant protein 
target of both trapoxin and trichostatin. When the affinity purification was repeated with 
Jurkat nuclear extracts, only two major bands, p50 and p55, were observed by silver 
staining (Figure 2B). Recovery of human p50 and p55 was similarly abolished by 
trapoxin (Figure 2B) and trichostatin (8). Because the relative intensities of bovine p46- 
30 p49 vary with each protein preparation, we suspect that they are proteolytic fragments 
derived from the bovine equivalent of human p55. One of the bands (p50) is common to 
both human and bovine sources. 

Large scale purification of the bovine proteins led to the resolution of two major 
bands of -46 and -50 kD in the final preparative electrophoresis step, both of which 
35 were submitted for microsequencing. 
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To obtain enough trapoxin binding protein for microsequencing, nuclear 
aitimonium sulfate pellets from 15 bovine thymuses were prepared as described above. 
Sedimented proteins were resuspended in and dialyzed against buflFer A [20 mM bistris 
(pH 7.2), 20 mM NaCl, 10% glycerol] for 12 hours, and brought to pH 5.8 by dialyzing 
5 against bugger A (pH 5.8) for 30 minutes. After centrifugation, the dialysate (-650 mg 
protein) was loaded onto a Q Sepharose FF column (2.6 x 10 cm; Pharmacia) and the 
column was washed vnth 120 ml buffer A (pH 5.8). Proteins wee eluted v^th a 400 ml 
linear gradient of 20 to 600 mM NaCl in buffer A. Fractions (10 ml; each fraction 
contained 1 ml of 1 M tris pH 8 to neutralize the acidic buffer A) were assayed for 

10 trapoxin binding activity. Tween-20 was added to active fractions at a final 
concentration of 0.05%, and these fractions were incubated with K-trap afl&nity matrix 
for 16 hours (25 jxl per ml Q fraction). After washing the matrix three times with 
phosphate buffered saline, bound proteins were eluted by boiling in 40 |il of SDS sample 
buffer per 25 |al of matrix. SDS eluates were combined and the proteins resolved by 

15 SDS-PVDF membrane (Biorad). Staining v^th Ponceau S revealed two major bands (46 
and 50 kD). The excised bands were proteolytically digested and the HPLC purified 
peptide fragments were sequenced at the Harvard Microchemistry Facility. 

The bovine protein of larger molecular size (--50 kD) corresponds to a known 
protein, RbAp48 (11), that consists of seven WD repeat domains (12). Originally 
20 identified as a protein that binds to the retinoblastoma gene product (pRb), RbAp48 may 
constitute an adaptor subunit that targets the histone deacetylase to specific chromatin 
domains. 

The -46 kD bovine protein is highly related to the protein encoded by the yeast 
RPD3 gene, which has been implicated by several genetic screens as a transcriptional 

25 regulator, but whose biochemical fianction is unknown (13). Partial cDNA sequences for 
the human gene were identified in the expressed sequence tag database (dbEST) and 
were used to design polymerase chain reaction (PGR) primers. Briefly, after noting 
sequence similarity between peptides derived from the purified bovine trapoxin binding 
protein and yeast RPD3, we checked dbEST to see whether any partial sequences for the 

30 human homologue had been reported. Two ESTs (Genbank accession numbers: 
D31480 and F07807) were identified whose predicted translation products aligned with 
high sequence similarity to NH2- and COOH- terminal regions of HDJ^ respectively, PCR 
primers were designed based on these tags and a one kilobase PCR product was obtained 
from a Jurkat cDNA library (Stratagene). A ^^P labeled probe prepared by random 

35 priming was used to screen the Jurkat library, and ten positive clones were isolated. One 



3DOCID: <WO STasSSOAP I > 



wo 97/35990 



PCT/US97/05275 



-93- 



of the clones was fiilly sequenced and found to contain a putative full-length open 
reading frame (Figure 3 A). The peptide sequences obtained from the purified bovine 
protein align v«th 100% identity to sequences deduced from this coding region (Figure 
3A, boxed residues). We call this human protein HDl (for histone deacetylase), and its 
5 predicted size of 55 kD agrees well with the estimated size of p55 isolated from Jurkat 
nuclear extracts using the K-trap affinity matrix (Figure 2B). A dbEST search indicated 
the existence of at least two other related human genes. 

To determine the relationship between the proteins from bovine thymus (p46- 
p50) and the proteins isolated from human Jurkat T cells (p50 and p55), an antiserum 

10 was generated against a peptide specified by the HDl open reading fi^e (Figure 3A, 
amino acids 319 to 334). Immunoblot analysis of the bovine proteins p46-p49 and the 
human protein p55 showed that they all react with the antiserum and provides additional 
evidence that these bands correspond to bovine and human HDl (Figure 3B). A 
monoclonal antibody that specifically recognizes RbAp48 was used to confinn the 

15 identity of bovine and hum p50. Importantly, neither HDJ nor RbAp48 was detected 
when the affinity purification was done in the presence of trapoxin or trichostatin (Figure 
3B). 

We used affinity purified antibodies directed against a COOH-terminal peptide 
(amino acids 467 to 482) to immunoprecipitate HDl from crude nuclear extracts. The 

20 immunoprecipitates contained histone deacetylase activity that was inhibited by both 
trapoxin and trichostatin (Figure 4A), Consistent with the idea that HDl and RbAp48 
form a complex in vivo, the two proteins coprecipitated with the anti-DHl antibodies 
(Figure 4B). Neither HDl, RbAp48, nor the associated histone deacetylase activity were 
immunoprecipitated in the presence of the HDl COOH-terminal peptide (Figure 4A and 

25 4B) (15). HDl, like RbAp48 (11), is detected predominantly in the nucleus by 
immunostaining with the aforementioned antibodies (8). Given that HDl and RbAp48 
are the major proteins eluted from the K-trap matrix (Figure 2B), it is likely that they 
interact directly with one another. 

We extended the results obtained with the endogenous protein by expressing 
recombinant FLAG epitope tagged HDl (HDl-F) in Jurkat T cells. Anti-FLAG 
immunoprecipitates from ceUs transfected with pBJ5/HDl-F contained histone 
deacetylase activity that was sensitive to both trapoxin and trichostatin (Figure 4C). 
Histone deacetylase activity was not precipitated when the antibody was blocked with 
excess FLAG peptide (15). Interestingly, endogenous RbAp48 did not coprecipitate with 
overexpressed HDl-F (8). demonstrating that RbAp48 is not required for either histone 
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deacetylase or trapoxin binding activity. The result is consistent with the idea that 
RbAp48 serves a targeting rather than an enzymatic function. Finally, lysates from ceUs 
transfected with pBJS/HDJ-F were incubated with the K-trap affinity matrix in the 
presence or absence of trapoxin and trichostatin. Protein immunoblot analysis 
5 demonstrated an interaction between recombinant HDJ-F and the K-trap affinity matrix 
that was fully competed by nanomolar concentrations of trapoxin or trichostatin (Figure 
4D). 

HDJ is 60% identical to the protein encoded by the yeast KPD3 gene, which was 
isolated in four independent mutant suppressor screens designed to identify 

10 transcriptional repressors (13, 16, 17, 18, 19). No biochemical function for the yeast 
protein has previously been postulated. A negative regulator of the TRK2 gene, RPD3 is 
necessary for the transcriptional repression of several genes whose expression is 
regulated according to specific environmental conditions. Loss of RPD3 also leads to 
decreased transcriptional activation of certain genes, but this effect may be indirect (13, 

15 17). Although RPD3 had yet to be implicated in silencing at telmomeres or the mating 
loci, the fact that silencing is eliminated by point mutations in specific lysine residues near 
the NH2-terminus of histones H3 and H4 suggests that lysine deacetylation may 
contribute to the maintenance of silenced chromatin (20, 21, 22, 23). Indeed, silencing at 
telomeres and the mating loci has been correlated with the presence of hypoacetylated 

20 histones, and sir mutants which are defective in silencing show a corresponding increase 
in the extent of histone acetylation at these loci (24). The SIR3 and SIR4 proteins have 
been shown to interact v^th a bacterially expressed histone H4 NH2-terminal domain in 
vitro (25), and it is possible that deacetylation of one or more lysine residues is required 
for this interaction in vivo. Our results further support a role for histone deacetylase as a 

25 transcriptional regulator and establish a biochemical connection to the genetic studies that 
originally characterized RPD3. 

How does inhibition of histone deacetylase in mammalian cells lead to Gj and G2 
phase cell cycle arrest? One possibility is that specific cell cycle regulatory proteins such 
as the cyclin dependent kinase inhibitors are transcriptionally upregulated in response to 
30 histone deacetylase inactivation. Alternatively, cell cycle checkpoints may exist that 
monitor histone acetylation or higher-order chromatin structure. It should now be 
possible to study the regulation of histone deacetylase during the cell cycle, its substrate 
specificity, and the mechanism by which it is targeted to specific regions of the genome. 
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3H-Trapoxin was prepared from (S,S)-threitoi acetonide (9) by total synthesis, as 
outlined in Figures 8A-8C. 

As shown in Figure 8A, (S,S)-threitol acetonide (9) was monoprotected by 
treatment with triisopropylsilylchloride (TIPSCl) and sodium hydride in tetrahydrofuran 
(THF). The free alcohol was then subjected to Swem oxidation. Wittig reaction of the 
resulting aldehyde gave compound 10 in good yield for the three steps. Compound 10 
was then hydrogenated with deprotection of the primary alcohol, which was then 
converted to the bromide 1 1 in excellent yield. Bromide 1 1 was converted to the 
organocuprate and reacted with (S)-serine p-lactone to yield the benzyloxycarbonyl- 
(Cbz) protected amino acid 12. 

As shown in Figure 8B, 12jyas coupled to tripeptide methyl ester 14, and the 
methyl ester was saponified. The amino acid was then cyclized and the silyl protecting 
group was removed to yield cyclotetrapeptide 18 in 51% yield. 

Cyclotetrapeptide 18 was tritiated, as shown in Figure 8C. by oxidation of the 
primary alcohol with the Dess-Martin reagent, and the aldehyde was reduced with 
tntiated sodium borohydride to provide tritiated 18, which was converted to 
f3H]Trapoxin B by tosylation of the primary alcohol, deprotection of the diol, epoxide 
nng closure, and oxidation of the secondary alcohol to yield the desired compound. ' 
Non-radiolabelled 18 was converted to [^H]Trapoxin B, via tosylate 19, in 68% overall 
yield. 



Example 2 

K-Trap was prepared from (S,S)-threitol acetonide (9) by total synthesis, as 
25 outlined in Figures 9A-9C. As shown in Figure 9A, monoprotection and Swem 
oxidation of 9 yielded the aldehyde as above. Wittig homologation yielded carboxyUc 
acid 20, which was converted to the nuxed anhydride and treated with lithiated 
oxazolidinone 21 to provide 22 in excellent yield. Deprotection of the primary alcohol 
and conversion to the tosylate were followed by treatment of the potassium enolate with 
30 trisylazide according to the method of Evans to effect electrophiUc azide transfer in good 
overaU yield and stereoselectivity, providing compound 23. Removal of the chiral 
auxiliary and catalytic reduction of the azido function, with hydrogenation of the olefin, 
provided amino acid 24, which was N-protected to give the Fmoc derivative 25 in high 
overall yield. 
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Referring to Figure 9B, protected amino acid 25 was coupled to tripeptide methyl 
ester 26. The methyl ester was saponified to yield 27, which was cyciized under high- 
dilution conditions to provide cyclotetrapeptide 28 in 58% yield- 
As shown in Figure 9C, compound 28 was converted to K-trap (29) by 
5 deprotection of the diol, base-promoted epoxide closure, and oxidation of the secondary 
alcohol to provide K-trap (29) in good overall yield, The K-trap affinity matrix 30 was 
provided by palladium-catalyzed removal of the allyloxycarbonyl (Alloc) group from the 
lysine residue of 29, and immobilization on Affigel 10. 
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Example 3 
Histone Deacetylase Activity is Required 
for Full Transcriptional Repression by mSin3 A 



The Mad family of basic region-Helix-Loop-Helix-Leucine Zipper (BHLHZip) 
proteins play an important role in controlling cell proliferation and diflferentiation (for 
reviews see: Amati and Land, 1994; Bernards 1995). Four identified Mad family 
members: Madl, Mxil, Mad3 and Mad4 (Ayer et al., 1993; Zervos et al., 1993; HurUn et 
15 al., 1995a) form heterodimers with another BHLHZip protein. Max; to repress 
transcription (Ayer et al., 1993;.Hurlin et al., 1995a) and are thought to play a negative 
role in the control of cell proliferation. 

Two mammalian homologs of the Saccharomyces cerevisiae transcriptional 
corepressor SIN3, mSin3A and mSin3B. have recently been identified as Mad interaaing 

20 proteins and are required for Mad-mediated transcriptional repression (Ayer et al., 1995; 
Schreiber-Agus et al., 1995). The most conserved regions of these proteins correspond 
to four putative paired amphipathic helbc (PAH) motifs, which have been proposed to 
constitute protein-protein interaction surfaces (Wang et al., 1990). The second PAH 
motif in mSinSA, mSin3B and Sin3p interacts with the mSin3 interaction domain or SID 

25 in the anuno terminus of the four Mad family members (Ayer et al., 1 995; Schreiber-Agus 
et al., 1995; Hurlin et al., 1995a; Kasten et al., 1996). Madl, Max and mSin3A form 
ternary complexes capable of binding DNA (Ayer et al., 1995). Point mutations in the 
Sm domain of Madl disrupt its ability to bind mSin3A, negate its fiinction as a 
trancriptional repressor (Ayer et al., 1995). and eUminate Madl function in several 

30 biological assays (Koskinen et al., 1995; Roussel et al., 1996). These findings suggest 
that MadrMax heterocomplexes repress transcription by tethering either mSin3A or 
mSin3B to DNA. A chimeric protein fusing the SID of Mad 1 to the GAL4 DNA-binding 
domain results in repression of simple and complex promoters in a manner that is 
dependent on mSin3 binding, suggesting that targeting mSin3 to DNA is necessary for 
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repression (Ayer et al., 1996). Nevertheless, the molecular mechanism(s) for mSinSA— 
mediated repression remain unknown. 

As described in example 1, a mammalian histone deacetylase has been identified, 
and cDNAs encoding the protein, histone deacetylase 1 (HDl or HDACl for the 
5 purposes of this example), have been cloned (see also Taunton et al., 1996b). HDACl is 
approximately 60% identical to the S, cerevisiae RPD3 protein, which is a component of 
a yeast histone deacetylase complex (Rundlett et al., 1996). Single mutations in either 
RPD3 or SIN3 give the same phenotypes as RPD3/SIN3 double, mutants suggesting that 
they function in the same pathway (Stillman et al., 1994). Because Mad family proteins 

10 use mSinSA as a corepressor and Madl can repress transcription in wild-type yeast but 
not yeast having a null mutation in SINS (Kasten et al., 1996) or RPD3 (D.J. Stillman, 
personal comm.), it is likely that the mechanism of transcriptional repression by Mad 
proteins may be conserved between yeast and higher eukaryotes. Consistent with this 
hypothesis, the DNA-binding transcription factor YYl interacts with a mammalian RPD3 

15 homolog to repress transcription of a heterologous reporter gene (Yang et. at, 1996). 
These results demonstrate that mammalian RPD3-like activity functions in transcriptional 
regulation. 

Several lines of evidence suggest that the acetylation status of conserved lysines in 
the amino terminal domains of histones H3 and H4 play a role in the regulation of 

20 transcription. In general, histone hyperacetylation correlates with transcriptionally active 
or poised genes; conversely, hypoacetylation correlates with transcriptionally repressed 
heterochromatin (for reviews see: Turner, 1993; Loidl, 1994; WolfFe, 1996). While little 
is known about the targeting and regulation of histone acetyltransferases and 
deacetylases, it has been recently shown that several transcriptional coactivators possess 

25 inherent acetyltransferase activity (Brownell et al., 1996; Ogryzko et al., 1996) or 
associate wdth acetyltransferases (Yang et al., 1996b). We report that mSin3A and 
HDACl associate in vivo and that the histone deacetylase inhibitor trapoxin interferes 
with mSin3 A-mediated transcriptional repression. 

30 Results 

(i) mSm3A is present in cells as a large stable multi protein complex. 

To study the in vivo function of mSin3A we generated polyclonal antiserum 
specific for the PAH2 domain of mSin3A. We tested this antiserum by 
inununoprecipitation using nuclear lysates made from the myeloid leukenua cell line U937 
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that had been metabolically labeled with "s-methionine. Analysis of immimoprecipitates 
showed an intensely-labeled doublet with an apparent molecular weight of 150 
kiloDaltons that was present in the anti-mSin3A immunoprecipitates (Figure lOA). This 
doublet comigrated with in vitro translated-mSinSA, shared identical V8 protease 
5 digestion peptides with in vitro translated-SinSA and was absent from 
inununoprecipitations using preimmune serum or immune serum preincubated with the 
cognate immunogen (data not shown). 

Fractionation of U937 nuclear extracts by size exclusion chromatography 
indicated that mSin3A is present in large molecular weight complex(es) (D.E.A., 

10 unpublished). To address this possibility, we performed immunoprecipitations from 
metabolically-labeled U937 cells under conditions that should preserve protein-protein 
interactions. In addition to mSin3A, the low-stringency mSin3A immunoprecipitates 
contained several labeled polypeptides of apparent molecular weight 250 kDa, 180 kDa, 
55 kDa, 50 kDa, 42 kDa, 33-36 kDa and 30 kDa (Figure lOA). These proteins were not 

15 detected in immunoprecipitates using mSin3A antiserum blocked with the cognate 
immunogen, suggesting that the proteins detected are specifically associated with 
mSin3A. Furthermore, none of these proteins were detected using high-stringency 
immunoprecipitation or by western blotting of whole-cell lysates using anti-Sin3A, 
suggesting that they do not share epitopes with mSin3A and are not proteolytic 
20 breakdown products of mSin3A (data not shown). All of the associated proteins appear 
to be present in substoichiometric amounts to mSin3A, suggesting that mSin3A 
complexes are heterogeneous. 

To test the stability of the mSin3 A complex, we subjected low-stringency mStn3A 
immunoprecipitates to different^ salt concentrations and ionic detergent conditions. The 

25 proteins that remained bound to mSin3A in the immunecomplex were analyzed by SDS- 
PAGE. Under the most stringent conditions we observed only a slight loss of mSin3A- 
associated proteins in the immune complex (Figure lOB). One exception to this finding 
was the apparently quantitative loss of p42 under sUghtly-elevated salt concentrations. 
These findings demonstrate that the mSin3A complex is stable in vivo and suggests that 

30 some or all of the mSin3A-associated proteins may facilitate mSin3A fiinction as a 
transcriptional co-repressor. 



(ii) HDACl and RbAp48 are components of the mSin3A complex. 
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Because SIN3 and RPD3 appear to function in the same pathway in yeast and two 
components of the mSin3A complex, p50 and p55, are similar in apparent molecular 
weight to HDACl we hypothesized that HDACl or related proteins might be 
components of the mSin3 A repressor complex. To test this hypothesis, the proteins 
5 bound to mSin3A immunecomplexes were elated with ionic detergents and reprecipitated 
with affinity purified antibodies specific for an internal peptide of HDACl. Only two 
proteins eluted fi^om the mSin3A complex were re-precipitated by HDACl antiserum 
(Figure 1 1 A). These polypeptides comigrated with p50 and p55 firom the low stringency 
mSin3A immunoprecipitation, suggesting that proteins highly related to HDACl are 
10 complexed to mSin3A in vivo. p55 comigrates with in vitro translated HDACl and is 
recognized by an antibody specific for a carboxy-terminal epitope unique to HDACl. 
Another CDNA encoding an HDACl homolog, HDAC2, has recently been identified 
(Yang et al., 1996a). It is likely that p50 represents HDAC2 (data not shown). 

In a reciprocal experiment, we performed low stringency immunoprecipitations 
15 using antiserum specific for an epitope at the carboxy-terminus of HDACl. HDACl 
immunoprecip-itates contain several proteins that were specifically competed with the 
immunizing peptide (Figure IIB). A polypeptide doublet that comigrated with mSin3A 
was detected in the HDACl inrununocomplexes (Figure IIB and IIC). To confirm that 
the doublet coprecipitating with HDACl is mSin3A, the HDACl immunocomplex was 
20 eluted and reprecipitated with antiserum specific for mSin3A (Figure IIC). The two 
proteins in this precipitate comigrated with mSin3A, confirming that mSin3A and 
HDACl are associated in vivo. 

To determine whether HDACl associated with mSin3A in vivo is enzymatically 
active, we assayed low-stringency immunoprecipitates for histone deacetylase activity. 

25 We used a synthetic peptide corresponding to the first twenty four amino acids of histone 
H4 as a substrate for our deacetylase assay (Taunton et al., 1996b). Low-stringency anti- 
mSinBA immunoprecipitates contained deacetylase activity; however, only background 
levels of deacetylase activity were detected in the immunoprecipitates if the mSin3A anti- 
serum was blocked with cognate immunogen (Figure 1 ID). To confirm the authenticity 

30 of the mSin3A associated activity we treated the immunoprecipitates v«th synthetic 
trapoxin, a specific inhibitor of histone deacetylase activity (Taunton et al., 1996a). 
Treatment of mSin3A complexes in vitro with 10 nM trapoxm reduced deacetylation by 
qjproximately 50% (Figure IID), suggesting that the precipitated deacetylase activity 
can be attributed to trapoxin-sensitive histone deacetylases bound to mSin3A. 
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We detected an interaction between HDACl and RbAp48 in vivo (Figure IIB, 
lie and Example 1). The low-stringency mSin3 A immunoprecipitation shown in Figure 
1 IC also contained a protein that comigrated with RbAp48 (marked with an asterisk) that 
was not readily visible on the shorter exposures of low stringency- mSin3 
5 immunoprecipitations (Figure 11 A). We have identified RbAp48 in mSin3A 
immunoprecipitates from cell extracts of nontransfected cells by western blotting, further 
demonstrating that mSin3A and RbAp48 associate in vivo (Figure 12A). 

To address further the association between HDACl and RbAp48 with mSinSA, 
we expressed the mammalian proteins in insect cells using recombinant baculoviruses. To 

10 this end, we expressed recombinant FLAG-epitope tagged HDACl (HDACl -F) that 
could be inununo-purified by anti-FLAG antibodies and histidine-tagged mSin3A 
(mSin3A-H) that could be purified by nickel affinity (data not shown). HDACl-F was 
immunoprecipitated from infected Sf9 cell extracts by anti-FLAG antibodies in the 
presence or absence of mSin3A-H. HDACl -F was also precipitated by Ni^'^-NTA 

15 agarose in a manner that was dependent on coexpression of mSin3A-H (Figure 12B), 
demonstrating that a complex between HDAC 1 and mSin3A is formed in insect cells 
using exogenously expressed human proteins. 

Consistent with our finding that RbAp48 is associated with mSin3A and HDACl 
in vivo, we show that baculovirus expressed Flu-epitope tagged RbAp48 (p48-HA) is 

20 specifically precipitated from infected Sf9 cell extracts using anti-FLAG antibody only 
when HDACl-F is coexpressed. Furthermore, p48-HA is specifically retained by Ni^"^- 
NTA in the presence of mSin3A-H (Figure 12C). Co-expression of p48.HA did not 
appear to effect the association between HDACl-F and mSin3A-H, suggesting that the 
regions of interaction are distinct and that all three proteins can associate simultaneously. 

25 These data suggest a direct interaction between mSin3A, HDACl and RbAp48 in vivo, 

(iii) Transcription repression by mSin3 A requires histone deacetylase activity. 

To investigate whether histone deacetylation plays a role in mSin3A-mediated 
transcriptional repression in vivo, we examined mSin3A-specific repression in the 
30 presence and absence of the histone deacetylase inhibitor trapoxin. 293 cells were 
transfected with a luciferase reporter gene constujct containing a minimal promoter 
consisting of only a TATA box and initiation site derived from the myelomonocytic 
growth factor gene (Figure 13 A). This reporter has four consensus binding sites for the 
DNA binding domain of the S. cerevisiae transcriptional activator GAL4 and therefore is 
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responsive to chimeric proteins containing the GAL4 DNA binding domain (GALDBD) 
(Stemeck et al., 1992). We have used this reporter construct previously to demonstrate 
that fusion of the SID repressor region of Madl to the GALDBD is necessary for 
mSin3A-dependent transcriptional repression. Furthermore, we have shown that fusion 
5 of SID to the potent transcriptional activator GALVP16, MadN35GALVP16, can cancel 
the activation function of VPI6 in an mSin3 A-dependent manner. Consistent with our 
previous results (Ayer et al., 1996), MadN35GALVP16 activated transcription from the 
reporter gene approximately 100-fold less well than GALVP16 (data not shown). As a 
negative control we engineered two proline substitutions into the SID of Madl, 

10 Mad(Pro); this protein cannot bind mSin3A in vitro (Ayer et al,, 1995). Consistent with 
an inability to interact with mSin3A, Mad(Pro)GALVP16 is a much less potent repressor 
(Figure 13B). In control experiments we have shovra that the observed effects require 
the presence of GAL4 sites in the promoter and that both MadN35GALVP16 and 
Mad(Pro)N35GALVP16 are expressed to equivalent levels in these cells and bind GAL4 

15 sites with similar affinities (data not shown). To test the role of histone deacetylation on 
the repression observed in our transfection assays, we first examined the efiFect of 
trapoxin on histone deacetylase activity in 293 cells. As expected, in vivo treatment with 
10 nM trapoxin for eight hours reduced deacetylase activity of both crude 293 extracts 
and anti-HDACl immunopurified complexes by approximately 46% and 58%, 

20 respectively (Figure 13C). 

To test the effect of a histone deacetylase inhibitor on MadN35GALVP16 and 
Mad(Pro)N35GALVPI6 mediated repression, we treated a duplicate set of transfections 
vrith 10 nM trapoxin for eight hours prior to harvest. In the representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 

25 fold while it had little effect on the activity of Mad(Pro)N35GALVP16, suggesting that 
the histone deacetylation plays a direct role in mSin3A transcriptional repression (Figure 
13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfected with the expression vector alone or in 
cells transfected with GALVP16 (data not shown). Following trapoxin treatment, the 

30 repression observed for MadN35GALVP16 was still seven times greater than that of 
Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A-mediated repression; 
however, we can not rule out that mSin3A is capable of repression by mechanisms 
independent of histone deacetylation. 

35 
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Discussion 

Earlier studies implicated mSin3 as the primary candidate for the negative 
transcriptional function of the DNA binding transcription factor Mad (Ayer et al., 1995; 
Schreiber-Agus et al. 1995; Ayer et al., 1996). We present evidence that the mSin3A 
5 compressor is part of a high molecular weight, multicomponent complex(es) that contains 
active histone deacetylase, thereby implicating histone deacetylation as a potential 
mechanism for mSin3A-mediated repression. Furthermore, we observe a pronounced 
increase in the transcriptional activity of an mSin3A-siienced reporter gene upon 
treatment in vivo wdth the specific histone deacetylase inhibitor trapoxin, suggesting that 
10 fijll transcriptional repression by mSin3A requires histone deacetylation. These results 
suggest a mechanism of gene regulation through the targeting of an enzyme that alters 
chromatin stmcture. 

These observations are consistent with genetic experiments in yeast, suggesting 
that the yeast orthologs of mSin3A and HDACI, SIN3 and RPD3 respeaively, are 
15 epistatic transcriptional regulators (Stillman et al., 1994). Furthermore, recent 
biochemical evidence demonstrates that Rpd3p is a component of a large molecular 
weight histone deacetylase complex in yeast (Rundlett et al., 1996). Together vvnth our 
results, these findings predict a conservation of the mSin3/HDACl fiinctional association 
in yeast. 

20 We have used chimeric transcriptional regulators to discern the effects of trapoxin 

on the activity of our reporter genes. The MadN35GALVP16 chimera fimctioned as a 
repressor by a mechanism that was dependent on the binding of mSin3A and that was 
sensitive to trapoxin. The same mutations that inactivate MadN35GALVP16 as a 
transcriptional repressor (i.e. Mad(Pro)N35GALVP16), also block interaction between 

25 Madl and mSin3A /n vitro and Mad! function in vivo. Therefore, it is likely that 
Mad:Max heterocompJexes repress transcription in a manner dependent on an mSin3A- 
associated histone deacetylase. 

By co-immunoprecipitation we have demonstrated that mSin3A and HDACI 
associate in vivo. Consistent wiUi these data we observed nuclear colocalization of 
30 mSin3A and HDACI by immunofluorescence microscopy (data not shown). Finally, 
overexpression in insect cells facilitates co-purification of mSin3A and HDACI (Figure 
12B and C), suggesting that the interaction between mSin3A and HDACI is either direct 
or requires a conserved cofactor. The finding that mSin3A has different associated 
histone deacetylases (HDACI and HDAC2) suggests that the mSin3A complex(es) may 
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have multiple substrate or target specificities. The heterogeneous nature of the mSin3A 
complex potentially reflects a diverse array of repressors, histone deacetylases and 
different targeting molecules that facilitate mSin3A-dependent alterations in gene 
expression. 

5 At least five additional polypeptides are stably associated with mSin3A (Figures 

10 and 11); whose function is currently unknown but tight association with mSinSA in 
both U937 cells and Jurkat T cells (data not shown) suggests that they in some way 
mediate mSin3 A function. Furthermore, we have identified an association between 
mSinSA and RbAp48 in v/va, suggesting that this protein may play a role in regulating 

10 mSinSA-targeted deacetylation. RbAp48 was originally identified as a retinoblastoma 
bmding protein that contains WD repeats and shares homology with the -subunit of G- 
proteins (Qian et al., 1993). Subsequently, it has been shown that RbAp48 or its 
orthologs are involved in targeting different histone modifying enzymes to chromatin 
(Parthun et al., 1996; Taunton et ai., 1996b; Tyler et al., 1996; Verreault et al., 1996). 

15 The mSin3A/RbAp48 complex isolated from U937 cells (Figure 11) is likely to represent 
only a small fraction of the mSin3A complexes, but its detection implies that mSin3A may 
play a role in the control of different aspects of chromatin physiology as well as 
transcription repression. 

It is unclear how different chromatin states facilitate transcription repression and 

20 activation or how their distinct biochemical states arise; however, there is ample 
cytological, genetic and biochemical evidence supporting the model that hyperacetylated 
chromatin is transcriptionally more active than hypoacetylated chromatin. Acetylation 
levels in -heterochromatin of Drosophila melemogasier polytene chromosomes are 
significantly reduced at lysine positions 5, 8, and 16 of histone H4, while the 

25 transcriptionally hyperactive X-chromosome of male flies is uniquely hyperacetylated at 
position 16 (Turner et al, 1992). In yeast, mutation of acetyl-accepting lysines in histone 
H4 reduces the activity of the GAU, PHOS and CUPI promoters in vivo (Durrin et al., 
1991). The transcriptionally silent regions in yeast, HML and HMR, are hypoacetylated 
and their activation is correlated with acetylation of histone H4 (Braunstein et al„ 1993). 

30 Additionally, biochemical studies showed that certain transcription factors have higher 
affinity for their binding sites when those sites are embedded in chromatin assembled fi-om 
hyperacetylated histones (Lee et al., 1993; Vettes-Dadey al., 1996). Finally, evidence 
suggesting that acetylation is required for activation comes from the recent demonstration 
that several coactivators either encode acetyltransferases or are associated with 

35 acetyltransferases. Thus, our data support this general model for the control of gene 
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expression by histone acetylation status and provide a biochemicaJ mechanism for 
deacetylation-mediated repression. 

The acetylalion status of a particular chromatin region represents a balance 
between competing, acetylation and deacetylation reactions. We propose that 

5 MadN35GALVPI6 recruits mSin3A-HDAC complexes to specific sites on DNA and 
shifts this equilibrium towards deacetylation and subsequent transcription repression by 
creating a high effective molarity of the histone deacetylase. In yeast, the activation 
domain of VP 16 has been shown to use the acetyltransferase GenSp as a coactivator 
(Marcus et al., 1994; Brownell et al., 1996), suggesting that in mammalian cells VP 16 

0 will also use an acetyltransferase as a cofactor. Thus, trapoxin treatment could shift the 
equilibrium from deacetylation to acetylation and thereby drive activation. 

Whether histone deacetylation will always have a negative effect on gene 
expression is unclear. Mutants in SIN3 and RPD3 can have both positive and negative 
effects on gene expression (Vidal and Gaber, 1991; Yoshimoto et al., 1992); however, 

5 for SIN3 there is evidence that positive effects may be indirect (Wang et al., 1994). In 
addition, mutations or deletions in RPD3 have recently been shown to enhance telomeric 
silencing both in yeast and in fruit fly (Sussel et al., 1995; De Rubertis et al., 1996; 
Rundlert et al., 1996). In mammalian cells, deacetylase inhibitors can inhibit MyoD- 
(Johnston et a!., 1992) and steroid receptor^activated transcription (McKnight et al.. 

) 1990; Bresnick et al., 1990). While it remains to be shown that the effects of RPD3 on 
silencing are direct, this evidence suggests that histone deacetylation can elicit both 
positive and negative effects on gene expression. Determining the factors that govern the 
functional outcome of histone deacetylation will provide fertile ground for further 
experimentation. 

Experimental Procedures 

Antibodies, cell culture, and Immunoprecipitations: To generate antiserum 
specific for mSinSA a GST fiasion protein encoding, amino acids 251 through 405 of 
mSin3A was used to inmiunize a New Zealand White rabbit. The crude serum was 
► passed over a GST column to remove the anti-GST antibodies. U937 cells were grown 
in RPMI supplemented with 10% calf serum (Hyclone), glutamine and penicillin- 
streptomycin. Low and high stringency immunoprecipitations were performed essentially 
as described (Ayer and Eisenman, 1993). To elute proteins fi-om low stringency 
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immunoprecipitates, they were incubated for 60 minutes at room temperature in antibody 
buffer and reprecipitated under high stringency conditions. 

Luciferase assays: 293 cells were seeded in triplicate onto 60 mm dishes at 3x10^ 
cells in 4 ml DME with 10% calf serum (Hyclone), Six hours after seeding, cells were 
5 transfected with 50 ng luciferase reporter, 50 ng CMV-P-gal, 50 ng expression construa, 
and 2.85 fig carrier DNA using the BBS/CaP04 method. 10 nM trapoxin was added to 
the media 8 hours prior to the luciferase assays. Cell lysates were prepared 20-24 hours 
following transfections, and luciferase and P-galactosidase activities was assayed 
according to manufacturer directions (Promega, Tropix). Luciferase values (relative light 
10 units) were normalized for transfection efficiency by dividing by p-gal activity. 

Histone deacetylase assays: In vitro histone deacetylase activity was assayed 
essentially as described with either 50 ^1 of crude cell extract (approximately 5 X 10^ 
Cells) or immunopurified cell extracts (approximately 2 X lO'' cells) for 2.5 hrs at 37 C 
(Taunton et al., 1996b). Pretreatment of crude or immunopurified extract with synthetic 

15 trapoxin was performed for 30 minutes at 4 C prior to addition of peptide substrate. TAg 
Jurkat and 293 cell extracts for histone deacetylase assays were prepared as in Taunton et 
al., 1996. Anti-HDACl and anti-mSin3A immunoprecipitations were performed as 
described above and in Figure 11. The Protein- A conjugated immunoprecipitates were 
washed three times in J-buffer plus I mM EDTA and resuspended in J-buflFer without 

20 Triton-X-100, and histone deacetylase activity was measured as described. 

Baculoviruses: cDNAs encoding Flag-tagged HDACl, HA-tagged RbAp48 and 
His-tagged mSin3A were cloned into the transfer vector pVL 1392 (specific details on 
the construction of these vectors is available upon request). Recombinant virus was 
generated using Baculogold DNA according to the manufactures instructions 
25 (Pharminigen). Sf9 or High 5 cells were infected at high multiplicity, extracts prepared 
48 hours post infection and immunoprecipitations performed as described above. Ni - 
NTA agarose and anti-Flag antibody were purchased from Qiagen and Kodak-IBI, 
respectively. 
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^ Example 4 

Since the priority date of this application, a number of other mammalian HDx 
genes have been described in the literature. In particular, a mouse HDl clone is 
identified in GenBank as accession number U807080. Another HDx member, HD2 
(HDAC'2) is also described for both human and mouse; see for example, GenBank 
10 entries U31814 and U31758. Without exceprion, each clone includes a v motifF 
represented in the general formula of SEQ ID No. 12, and a x motif represented in the 
general formula SEQ ED No. 14. 

All of the above-cited references and publications are hereby incorporated by 
15 reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific polypeptides, nucleic acids, 
methods, assays and reagents described herein. Such equivalents are considered to be 
20 within the scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

<i) APPLICANT: Schreiber, Stuart L. 

Taunton, Jack 
HasBig, Christian A. 
Jamison, Timothy F. 



10 



(ii) TITLE OF INVENTION: Histone Deacetylases and Uses Related 
Thereto 



(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FOLEY, HOAG £ ELIOT, LLP 

(B) STREET: One Post Office Sqxiare 

(C) CITY: Boston 
20 (D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 
CD) SOFTWARE: ASCI I (text) 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 26-MAR-1996 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Vincent, Matthew P. 

(B) REGISTRATION NUMBER: 36,709 

(C) REFERENCE /DOCKET NUMBER: HUV019.25 

40 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 832-1000 

(B) TELEFAX: (617) 832-7000 



45 <2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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( Ix ) FEATURE : 

<A) NAME/KEY: CDS 

(B) LOCATION: 1. . 1446 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG GCG CAG ACG CAG GGC ACC CGG AGG AAA GTC TGT TAC TAC TAC GAC 48 
10 Met Ala Gin Thr Gin Gly Thr Arg Arg Lys Val Cys Tyr Tyr Tyr Asp 
1 5 10 15 

GGG GAT GTT GGA AAT TAC TAT TAT GGA CAA GGC CAC CCA ATG AAG CCT 96 
Gly Asp Val Gly Asn Tyr Tyr Tyr Gly Gin Gly Hie Pro Met I#yB Pro 
15 20 25 30 

CAC CGA ATC CGC ATG ACT CAT AAT TTG CTG CTC AAC TAT GGT CTC TAC 144 
His Arg lie Arg Met Thr His Asn Leu Leu Leu Aen Tyr Gly Leu Tyr 
35 40 45 

20 

CGA AAA ATG GAA ATC TAT CGC CCT CAC AAA GCC AAT GCT GAG GAG ATG 192 
Arg Lys Met Glu lie Tyr Arg Pro Hie Lys Ala Asn Ala Glu Glu Met 
50 55 60 

25 ACC AAG TAC CAC AGC GAT GAC TAC ATT AAA TTC TTG CGC TCC ATC CGT 240 
Thr Lys Tyr His Ser Asp Asp Tyr lie Lys Phe Leu Arg Ser lie Arg 
65 70 75 80 

CCA GAT AAC ATG TCG GAG TAC AGC AAG CAG ATG CAG AGA TTC AAC GTT 288 
30 Pro Asp Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Asn Val 

85 90 95 

GGT GAG GAC TGT CCA GTA TTC GAT GGC CTG TTT GAG TTC TGT CAG TTG 336 
Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gin Leu 
35 100 105 110 

TCT ACT GGT GGT TCT GTG GCA ACT GCT GTG AAA CTT AAT AAG CAG CAG 384 

Ser Thr Gly Gly Ser Val Ala Ser Ala Val Lys Leu Asn Lys Gin Gin 
115 120 125 

40 

ACG GAC ATC GCT GTG AAT TGG GCT GGG GGG CTG CAC CAT GCA AAG AAG 432 

Thr Asp lie Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 

130 135 140 

45 TCC GAG GCA TCT GGC TTC TGT TAC GTC AAT GAT ATC GTC TTG GCC ATC 480 
Ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp lie Val Leu Ala lie 
145 150 155 160 

CTG GAA CTG CTA AAG TAT CAC CAG AGG GTG CTG TAC ATT GAC ATT GAT 528 
50 Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr lie Asp lie Asp 

165 170 175 

ATT CAC CAT GGT GAC GGC GTG GAA GAG GCC TTC TAC ACC ACG GAC CGG 576 
lie His His Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr Thr Asp Arg 
55 180 185 190 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GTC ATG ACT GTG TCC TTT CAT AAG TAT GGA GAG TAG TTC CCA GGA ACT 
Val Met Thr Val Ser Phe Hie Lys Tyr Gly Glu Tyr Phe Pro Gly Thr 
155 200 205 

GGG GAC CTA CGG GAT ATC GGG GOT GGC AAA GGC AAG TAT TAT GOT GTT 
Gly Asp Leu Arg Asp lie Gly Ala Gly Lys Gly Lye Tyr Tyr Ala Val 
210 215 220 

AAC TAG CCG CTC CGA GAC GGG ATT GAT GAC GAG TCC TAT GAG GCC ATT 
Asn Tyr Pro Leu Arg Asp Gly lie Asp Asp Glu Ser Tyr Glu Ala lie 

230 235 240 

TTC AAG CCG GTC ATG TCC AAA GTA ATG GAG ATG TTC CAG CCT AGT GOG 
Phe Lys Pro Val Met Ser Lys Val Met Glu Met Phe Gin Pro Ser Ala 
245 250 255 

GTG GTC TTA CAG TGT GGC TCA GAC TCC CTA TCT GGG GAT CGG TTA GGT 
Val Val Leu Gin Cys Gly Ser Asp Ser Leu Ser Gly Asp Arg Leu Gly 
260 265 270 

TGC TTC AAT CTA ACT ATC AAA GGA CAC GCC AAG TGT GTG GAA TTT GTC 
Cys Phe Asn Leu Thr He Lys Gly His Ala Lys Cys Val Glu Phe Val 
275 280 285 

AAG AGC TTT AAC CTG CCT ATG CTG ATG CTG GGA GGC GGT GGT TAC ACC 
Lys Ser Phe Asn Leu Pro Met Leu Met Leu Gly Gly Gly Gly Tyr Thr 

295 300 

ATT CGT AAC GTT GCC CGG TGC TGG ACA TAT GAG ACA GCT GTG GCC CTG 
lie Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu 

310 ^ 315 320 

GAT ACG GAG ATC CCT AAT GAG CTT CCA TAC AAT GAC TAC TTT GAA TAC 
Asp Thr Glu He Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr 
325 330 335 

TTT GGA CCA GAT TTC AAG CTC CAC ATC AGT CCT TCC AAT ATG ACT AAC 
Phe Gly Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn 
340 345 

CAG AAC ACG AAT GAG TAC CTG GAG AAG ATC AAA CAG CGA CTG TTT GAG 
Gin Asn Thr Asn Glu Tyr Leu Glu Lys He Lys Gin Arg Leu Phe Glu 
355 360 365 

AAC CTT AGA ATG CTG CCG CAC GCA CCT GGG GTC CAA ATG CAG GCG ATT 
Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gin Met Gin Ala He 

375 380 

CCT GAG GAC GCC ATC CCT GAG GAG AGT GGC GAT GAG GAC GAA GAC GAC 
Pro Glu Asp Ala He Pro Glu Glu Ser Gly Asp Glu Asp Glu Asp Asp 

390 395 400 

CCT GAC AAG CGC ATC TCG ATC TGC TCC TCT GAC AAA CGA ATT GCC TGT 
Pro Asp Lys Arg He Ser He Cys Ser Ser Asp Lys Arg He Ala Cys 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 
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405 410 415 

GAG GAA GAG TTC TCC GAT TCT GAA GAG GAG GGA GAG GGG GGC CGC AAG 1296 
Glu Glu Glu Phe Ser Asp Ser Glu Glu Glu Gly Glu Gly Gly Arg Lys 
5 420 425 430 

AAC TCT TCC AAC TTC AAA AAA GCC AAG AGA GTC AAA ACA GAG GAT GAA 1344 

Asn Ser Ser Asn Phe Lya Lye Ala Lye Arg Val Lye Thr Glu Asp Glu 

435 440 445 

10 

AAA GAG AAA GAC CCA GAG GAG AAG AAA GAA GTC ACC GAA GAG GAG AAA 1392 

Lys Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lye 
450 455 460 

15 ACC AAG GAG GAG AAG CCA GAA GCC AAA GGG GTC AAG GAG GAG GTC AAG 1440 
Thr Lys Glu Glu Lys Pro Glu Ala Lys Gly Val Lys Glu Glu Val Lys 
465 470 475 480 

TTG GCC TGA 1449 
20 Leu Ala 



(2) INFORMATION FOR SEQ ID NO; 2: 

25 

( i ) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
30 ( D ) TOPOI^y : linear 



(ii) MOLECULE TYPE: cDNA 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATTGACTTCC TGCAGAGAGT CAGCCCCACC AATATGCAAG GCTTCACCAA GAGTCTTAAT 60 

40 GCCTTCAACG TAGGCGATGA CTGCCCAGTG TTTCCCGGGC TCTTTGAGTT CTGCTCGCGT 120 

TACACAGGCG CATCTCTGCA AGGAGCAACC CAGCTGAACA ACAAGATCTG TGATATTGCC ISO 

ATTAACTGGG CTGGTGGTCT GCACCATGCC TAGAAGTTTG AGGCCTCTGG CTTCTGCTAT 240 

45 

GTCAACGACA TTGTGTTTGG CATCCTGGAG CTGCTCAAGT ACCACCCTCG GGTGCTCTAC 300 

ATTGACATTG ACATCCACCA TGGTGACGGG GTTCAAGAAG CTTTCTACCT CACTGACCGG 360 

50 GTCATGACGG TGTCCTTTC 379 

(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 375 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

10 

TACTACTGTC TGAACGTGCC CCTGCGGATG GGCATTGATG ACCAGAGTTA CAAGCACCTT 60 
TTCCAGCCGG TTATCAACCA GGTAGTGGAC TTCTACCAAC CCACGTGCAT TGTGCTCCAG 120 
15 TGTGGAGCTG ACTCTCTGGG CTGTGATCGA TTGGGCTGCT TTAACCTCAG CATCCGAGGG 180 
CATGGGGAAT GCGTTGAATA TGTCAAGAGC TTCAATATCC CTCTACTCGT GCTGGGTGGT 240 
GGTGGTTATA CTGTCCGAAA TGTTGCCCCC TGCTGGACAT ATGAGACATC GCTGCTGGTA 300 
GAAGAGGCCA TTAGTGAGGA GCTTCCCTAT AGTGAATACT TCGAGTACTT TGCCCCAGAC 360 
TTCACACTTC ATCCA 
25 (2) INFORMATION FOR SEQ ID NO: 4: 



375 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGTCATGCTA AATGTGTAGA AGTTGTAAAA ACTTTTAACT TACCATTACT GATGCTTGGA 60 
GGAGGTGGCT ACACAATCCG TAATGTTGCT CGATGTTGGA CATATGAGAC TGCAGTTGCC 120 
CTTGATTGTG AGATTCCCAA TGAGTTGCCA TATAATGATT ACTTTGAGTA TTTTGGACCA 180 
45 GACTTCAAAC TGCATATTAG TCCTTCAAAC ATGACAAACC AGAACAC 



227 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

■ Met Ala Gin Thr Gin Gly Thr Arg Arg Lye Val Cys Tyr Tyr Tyr Asp 
^ ^ 5 10 15 

Gly Asp Val Gly Asn Tyr Tyr Tyr Gly Gin Gly Hie Pro Met Lye Pro 
20 25 30 

10 Hie Arg lie Arg Met Thr His Asn Leu Leu Leu Aon Tyr Gly Leu Tyr 
35 40 45 

Arg Lys Met Glu lie Tyr Arg Pro His Lys Ala Asn Ala Glu Glu Met 
50 55 6Q 

15 

Thr Lys Tyr His Ser Asp Asp Tyr He Lys Phe Leu Arg Ser He Arg 

70 75 80 

Pro Asp Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Asn Val 
20 85 90 95 

Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gin Leu 
100 105 110 

25 Ser Thr Gly Gly Ser Val Ala Ser Ala Val Lye Leu Asn Lys Gin Gin 
115 120 125 

Thr Asp He Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 
130 135 140 

30 

Ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp He Val Leu Ala He 
145 150 155 160 

Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr He Asp He Asp 
35 165 170 175 

He His His Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr Thr Asp Arg 
180 185 190 

40 Val Met Thr Val Ser Phe His Lys Tyr Gly Glu Tyr Phe Pro Gly Thr 
195 200 205 

Gly Asp Leu Arg Asp He Gly Ala Gly Lys Gly Lys Tyr Tyr Ala Val 
210 215 220 

45 

Asn Tyr Pro Leu Arg Asp Gly He Asp Asp Glu Ser Tyr Glu Ala He 
225 230 235 240 

Pi^o Val Met Ser Lys Val Met Glu Met Phe Gin Pro Ser Ala 
^0 245 250 255 

Val Val Leu Gin Cys Gly Ser Asp Ser Leu Ser Gly Asp Arg Leu Gly 
260 265 270 

55 Cys Phe Asn Leu Thr He Lys Gly His Ala Lys Cys Val Glu Phe Val 
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275 280 285 

Lye Ser Phe Asn Leu Pro Met Leu Met Leu Gly Gly Gly Gly Tyr Thr 
^ 290 295 300 

He Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Vai Ala Leu 
3*^5 310 315 320 

Asp Thr Glu lie Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr 
10 325 330 335 

Phe Gly Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn 
340 345 350 

15 Gin Asn Thr Asn Glu Tyr Leu Glu Lys He Lys Gin Arg Leu Phe Glu 
355 360 365 

Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gin Met Gin Ala He 
^'^^ 375 380 

Pro Glu Asp Ala He Pro Glu Glu Ser Gly Asp Glu Asp Glu Asp Asp 
3S5 390 395 400 

Pro Asp Lys Arg He Ser He Cys Ser Ser Asp Lys Arg He Ala Cys 
405 410 415 

Glu Glu Glu Phe Ser Asp Ser Glu Glu Glu Gly Giu Gly Gly Arg Lys 
420 425 430 

30 Asn ser Ser Asn Phe Lys Lys Ala Lys Arg Val Lys Thr Glu Asp Glu 
435 440 445 

Lys Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lys 
3^ 455 460 

Thr Lys Glu Glu Lys Pro Glu Ala Lys Gly Val Lys Giu Giu Vai Lys 

470 475 480 

Leu Ala 

40 

<2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 
^5 (B) TYPE: amino acid 

(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (V) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Ile Asp Phe Leu Gin Arg Val Ser Pro Thr Aen Met Gin Gly Phe Thr 
15 10 15 

Lys Ser Leu Asn Ala Phe Asn Val Gly Asp Asp Cye Pro Val Phe Pro 
5 20 25 30 

Gly Leu Phe Glu Phe Cys Ser Arg Tyr Thr Gly Ala Ser Leu Gin Gly 
35 40 45 

10 Ala Thr Gin Leu Asn Asn Lys lie Cye Asp He Ala He Asn Trp Ala 

50 55 60 



15 



Gly Gly Leu His His Ala Lys Lys Phe Glu Ala Ser Gly Phe Cys Tyr 
65 70 75 80 

Val Asn Asp He Val Phe Gly He Leu Glu Leu Leu Lys Tyr His Pro 
85 90 95 



Arg Val Leu Tyr He Asp He Asp He His His Gly Asp Gly Val Gin 
20 100 105 110 

Glu Ala Phe Tyr Leu Thr Asp Arg Val Met Thr Val Ser Phe Pro Gin 
115 ^ 120 125 

25 He Arg Glu He Tyr 

130 

(2) INFORMATION FOR SEQ ID NO: 7: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 



40 



45 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 

Tyr Tyr Cys Leu Asn Val Pro Leu Arg Met Gly He Asp Asp Gin Ser 
1 5 10 15 

Tyr Lys His Leu Phe Gin Pro Val He Asn Gin Val Val Asp Phe Tyr 
20 25 30 



Gin Pro Thr Cys He Val Leu Gin Cys Gly Ala Asp Ser Leu Gly Cys 
50 35 40 45 

Asp Arg Leu Gly Cys Phe Aen Leu Ser He Arg Gly His Gly Glu Cys 
50 55 r 60 



55 



Val Glu Tyr Val Lys Ser Phe Asn He Pro Leu Leu Val Leu Gly Gly 
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^5 70 75 80 

Gly Gly Tyr Thr Val Arg Aen Val Ala Arg Cys Trp Thr Tyr Glu Thr 
5 85 90 95 

Ser Leu Leu Val Glu Glu Ala lie Ser Glu Glu Leu Pro Tyr Ser Glu 
100 105 110 

Tyr Phe Glu Tyr Phe Ala Pro Asp Phe Thr Leu His Pro 
10 - 115 120 125 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



30 



35 



Asn Leu Leu Val Leu Gly His Ala Lys Cys Val Glu Val Val Lys Thr 
^ S 10 15 

Phe Asn Leu Pro Leu Leu Met Leu Gly Gly Gly Cly Tyr Thr lie Arg 
20 25 30 

Aon Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu Asp Cys 
35 40 45 

Glu lie Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr Phe Glv 
50 55 60 



Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn Gin Asn 
^5 70 75 80 

(2) INFORMATION FOR SEQ ID NO: 11: 

^5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
(I>) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 

{ ix ) FEATURE : 
55 (A) NAME/KEY: CDS 



OCID: <WO 9735990A2 I > 
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-120- 

(B) LOCATION: l.»1275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GCC GAC AAG GAA OCA GCC TTC GAC GAC GCA GTG GAA GAA CGA GTG 48 
Met Ala Asp Lys Glu Ala Ala Phe Asp Asp Ala Val Glu Glu Airg Val 
15 10 15 



10 ATC AAC GAG GAA TAG AAA ATA TGG AAA AAG AAC ACC CCT TTT CTT TAT 
lie Asn Glu Glu Tyr Lye lie Trp Lys Lys Asn Thr Pro Phe Leu Tyr 
20 25 30 



96 



GAT TTG GTG ATG ACC CAT GCT CTG GAG TGG CCC AGC CTA ACT GCC CAG 144 
Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 
35 4Q 45 



TGG CTT CCA GAT GTA ACC AGA CCA GAA GGG AAA GAT TTC AGC ATT CAT 192 
Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser lie His 
20 50 55 60 



CGA CTT GTC CTG GGG ACA CAC ACA TCG GAT GAA CAA AAC CAT CTT GTT 240 
Arg Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val 
65 70 75 80 

ATA GCC AGT GTG CAG CTC CCT AAT GAT GAT GCT CAG TTT GAT GCG TCA 288 
lie Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
85 90 95 

30 CAC TAC GAC AGT GAG AAA GGA GAA TTT GGA GGT TTT GGT TCA GTT AGT 336 
His Tyr Asp Ser Glu Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Ser 
100 105 110 

GGA AAA ATT GAA ATA GAA ATC AAG ATC AAC CAT GAA GGA GAA GTA AAC 384 
35 Gly Lys lie Glu lie Glu lie Lys lie Asn His Glu Gly Glu Val Asn 
115 120 125 

AGG GCC CGT TAT ATG CCC CAG AAC CCT TGT ATC ATC GCA ACA AAG ACT 432 
Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys lie lie Ala Thr Lys Thr 
40 130 135 140 

CCT TCC AGT GAT GTT CTT GTC TTT GAC TAT ACA AAA CAT CCT TCT AAA 480 
Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 
145 150 155 160 

45 

CCA GAT CCT TCT GGA GAG TGC AAC CCA GAC TTG CGT CTC CGT GGA CAT 528 
Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 
165 170 175 

50 CAG AAG GAA GGC TAT GGG CTT TCT TGG AAC CCA AAT CTC AGT GGG CAC 576 
Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 

TTA CTT AGT GCT TCA GAT GAC CAT ACC ATC TGC CTG TGG GAC ATC AGT 624 
55 Leu Leu Ser Ala Ser Asp Asp His Thr lie Cys Leu Trp Asp lie Ser 



^nncin- <wn oTrifiPonAo i -. 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



GCC GTT 
Ala Val 
210 

GGG CAT 
Gly His 
225 

TCT CTG 
Ser Leu 



ACT CGT 
Thr Arg 



195 

CCA 
Pro 



ACG 
Thr 



TTT 
Phe 



TCA 
Ser 



ACT GCT 
Thr Ala 



GTT GCC 
Leu Ala 
290 

AAT CTG 
Aan Leu 
305 

TTC CAG 
Phe Gin 



GAA 
Glu 
275 

ACA 
Thr 



AAG GAG GGA AAA 
Lys Glu Gly Lys 
215 

GCA GTA GTA GAA 
Ala Val Val Glu 
230 

GGG TCA GTT GCT 
Gly Ser Val Ala 
245 

AAC AAT ACT TCC 
Asn Aan Thr Ser 
260 

GTG AAC TGC CTT 
Val Asn Cys Leu 



200 

GTG GTA GAT 
Val Val Asp 



GCG AAG 
Ala Lys 
220 



205 

ACC ATC TTT ACA 672 
Thr He Phe Thr 



GAT GTT TCC 
Asp Val Ser 



AAA 
Lys 



GTT 
Val 



GGT ACT 
Gly Thr 



GAA CAA 
Glu Gin 



GAT 
Asp 



TCC 
Ser 
355 



GGA TCA GCT GAC 
Gly Ser Ala Asp 
29^ 

CTT AAG TTG CAT 
Leu Lys Leu His 
310 

CAG TGG TCA CCT 
Gin Trp Ser Pro 
325 

CGC AGA CTG AAT 
Arg Arg Leu Aan 
340 

CCA GAA GAT GCA 
Pro Glu Aap Ala 



GAT GAT CAG 
Asp Asp Gin 
2 50 

AAA CCA AGC 
Lys Pro Ser 
265 

TCT TTC AAT 
Ser Phe Asn 
280 

AAG ACT GTT 
Lys Thr Val 



TCC TTT GAG 
Ser Phe Glu 



CAC AAT GAG 
His Asn Glu 
330 

GTC TGG GAT 
Val Trp Asp 
345 

GAA GAC GGG 
Glu Asp Gly 
360 



ATT CAT 
He His 
370 

AAT GAA 
Asn Glu 
365 



GTG TGG 
Val Trp 



CGT GGT CAT ACT GCC 
Gly Gly His Thr Ala 
375 

CCT TGG GTG ATT TGT 
Pro Trp Val He Cys 
390 

CAA ATG GCA GAG AAC 
Gin Met Ala Glu Asn 
405 



AAG ATA TCT 
Lys He Ser 



TCT GTA TCA 
Ser Val Ser 



ATT TAT AAT 
He Tyr Asn 
410 



55 AGC GTG GAT CCA GAA GGA CAA GGG TCC TAG 



TGG CAT CTA CTC CAT GAG 720 
Trp His Leu Leu His Glu 
235 240 

AAA CTT ATG ATT TGG GAT 768 
Lys Leu Met He Trp Asp 
255 

CAC TCA GTT GAT GCT CAC 816 
His Ser Val Asp Ala His 
270 

CCT TAT AGT GAG TTC ATT 864 
Pro Tyr Ser Glu Phe He 
285 

GCC TTG TGG GAT CTG AGA 912 
Ala Leu Trp Aep Leu Arg 
300 

TCA CAT AAG GAT GAA ATA 960 
Ser His Lys Asp Glu He 
315 320 

ACT ATT TTA GCT TCC AGT 1008 
Thr He Leu Ala Ser Ser 
335 

TTA AGT AAA ATT GGA GAG 1056 
Leu Ser Lys He Gly Glu 
350 

CCA CCA GAG TTG TTG TTT 1104 
Pro Pro Glu Leu Leu Phe 
365 

GAT TTC TCC TGG AAT CCC 1152 
Asp Phe Ser Trp Asn Pro 
380 

GAA GAC AAT ATC ATG CAA 1200 
Glu Asp Asn He Met Gin 
395 400 

GAT GAA GAC CCT GAA GGA 1248 
Asp Glu Asp Pro Glu Gly 
415 

1278 
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Ser Val Asp Pro Giu Gly Gin Gly Ser 
420 425 

(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Xaa Xaa Xaa Asn Xaa Xaa Gly Gly Leu His His Ala Lye Lys Xaa 
15 10 15 

Glu Ala Ser Gly Phe Cys Tyr Xaa Asn Asp lie Val Xaa Xaa lie Xaa 
20 25 30 



Glu Leu Leu Xaa Tyr His Xaa Arg Val Xaa Tyr lie Asp Xaa Asp Xaa 
25 35 40 45 

His His Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Arg Val 
50 55 60 

30 Met Thr Xaa Ser Phe 

65 

(2) INFORMATION FOR SEQ ID NO: 13: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp lie Ala Xaa Aan Trp Ala Gly Gly Leu His His Ala Lye Lys Xaa 
15 10 15 

50 Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp lie Val Xaa Xaa lie Leu 

20 25 30 



55 



Glu Leu Leu Lys Tyr His Xaa Arg Val Leu Tyr lie Asp lie Asp lie 
35 40 45 
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HxB HiB Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Arg Val 
50 55 50 

Met Thr Val Ser Phe 
65 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cye Val Xaa Xaa Xaa Lys Xaa Phe Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly 

1 5 T n 



5 10 15 

Val Ala Arg Xaa Trp Xa* 
25 30 



Gly Gly Gly Tyr Thr Xaa Arg Asn Val Ala Arg Xaa Trp Xaa Xaa Glu 

25 20 - 



Thr 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Cye Val Glu Xaa Val Lys Xaa Phe Asn Xaa Pro Leu Leu Xaa Leu Gly 
X 5 3^0 15 

Gly Gly Gly Tyr Thr Xaa Arg Aen Val Ala Arg Cys Trp Thr Tyr Glu 
20 25 30 

50 Thr 
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What is claimed is: 

1 . An isolated or recombinant HDx polypeptide. 

2. The polypeptide of claim 1, of mammalian origin. 

3. The polypeptide of claim 3, of human origin. 

5 4. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

5. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

6. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
10 sequence designated in SEQ ID No: 2. 

7. The polypeptide of claim 1, which polypeptide is encoded by a nucleic acid having a 
coding sequence, or portion thereof, which hybridizes under stringent conditions to 
the nucleic acid designated in SEQ ID No. 1 . 

9. The polypeptide of claim 1^ which polypeptide is an acetylase activity. 

15 10. The polypeptide of claim 1, which polypeptide binds to an RbAp48 protein, 

1 1 . The polypeptide of claim 1 , which polypeptide is a fusion protein. 

12. The polypeptide of claim 1, which polypeptide has a molecular weight in the range 
of 45-70 Kd. 

13. An isolated or recombinant polypeptide comprising zxvHDx polypeptide sequence 
20 homologous or identical SEQ ID No. 2, or a fragment thereof which retains one or 

more of (i) a histone deacetylase activity, (ii) a histone binding activity and (iii) an 
RbAp48 binding activity. 

14. The polypeptide of claim 13, which polypeptide comprises an HDx sequence 
represented in the general formula 

25 DXXNXGGLHHAKKXEASGFCYXNDIVXXI- 

XELLXYHXRVXYIDXDXHHGDGXEAFYXroRVMTXSF. 

15. The polypeptide of claim 13, which polypeptide comprises an HDx sequence 
represented in the general formula 
GVXXXKXFXXPXXXXGGGGYTXRNVAXOC-WXX^ 

30 16. The polypeptide of claim 13, which polypeptide deacetylates acetylated histones. 
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17. The polypeptide of claim 13, which polypeptide is a dominant negative inhibitor 
which antagonizes deacetylation of acetylated histones. 

18. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

5 19. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

20. The polypeptide of claim 13, which polypeptide includes at least 25 amino acid 
residues of an HDx polypeptide sequence. 

21. The polypeptide of claim 13, wherein said polypeptide modulates ceUular 
10 proliferation. 

22. An isolated or recombinant polypeptide comprising an HDx polypeptide sequence 
represented SEQ ID No. 2, or a fragment thereof which retains one or more of (i) a 
histone deacetylase activity, (ii) a histone binding activity and (iii) an RbAp48 
binding activity. 

15 23. The polypeptide of claim 22, wherein said fusion protein includes, as a second 
polypeptide sequence, a polypeptide which fimctions as a detectable label for 
detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein. 

24. The polypeptide of claim 13, wherein said polypeptide is a fusion protein further 
comprising, in addition to said HDx sequence, a second polypeptide sequence 
having an amino acid sequence unrelated to an HDx polypeptide sequence. 

25. A purified or recombinant HDx polypeptide encoded by a nucleic acid which 
hybridizes under stringent conditions to a nucleotide sequence designated in SEQ 
ED No. 1. 

25 26. A purified or recombinant HDx polypeptide comprising a v motif represented in the 

formula 

DIAX1NWAGGLHHAKKX2EASGFCYVNDIVX3X4ILELLKYH- 
X5RVLYIDIDIHHGDGX6EAFYX7TDRVMTVSF and a X motif represented in 
CVEX2VKX2FNX3P-X4LX5LGGGGYTX6RNVARCWTYET. 
An isolated nucleic acid which encodes a deacetylase activity and hybridizes under 
stringent conditions to a nucleotide sequence designated in SEQ ID No. 1. 



20 



30 27. 
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28. An isolated nucleic acid encoding an HDx polypeptide, which polypeptide 
specifically modulates histone acetylation. 

29. The nucleic acid of claim 28, which HDx polypeptide comprises a v motif 
represented in the general formula 

5 DIAX1NWAGGLHHAKKX2EASGFCYVNDIVX3X4ILELL- 

KYHX5RVLYromiHHGDGX6EAFYX7TDRVMTVSF and a x motif 
represented in CVEXi VKX2FNX3P-X4LX5LGGGGYTX6RNVARCWTYE^ 

30. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

10 31. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

32. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence designated in SEQ ID No: 2. 

33. The nucleic acid of claim 28, which HDx polypeptide has a molecular weight in the 
15 range of 45-70 Kd. 

34. The nucleic acid of claim 28, which HDx polypeptide is a fusion protein further 
comprising, in addition to HDx polypeptide sequences, a second polypeptide 
sequence having an amino acid sequence unrelated to a nucleic acid sequence. 

35. The nucleic acid of claim 34, wherein said fusion protein includes, as a second 
20 polypeptide sequence, a polypeptide which functions as a detectable label for 

detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein. 

36. The nucleic acid of claim 28, fiirther comprising a transcriptional regulatory 
sequence operably linked to said nucleotide sequence so as to render said nucleic 

25 acid suitable for use as an expression vector. 

37. An expression vector, capable of replicating in at least one of a prokaryotic cell and 
eukaryotic cell, comprising the nucleic acid of claim 36. 

38. A host cell transfected with the expression vector of claim 37 and expressing said 
recombinant polypeptide. 

30 39. A method of producing a recombinant HDx polypeptide comprising culturing the 
cell of claim 38 in a cell culture medium to express said recombinant polypeptide 
and isolating said recombinant polypeptide from said cell culture. 
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40. A transgenic non-human animal having cells which harbor a heterologous transgene 
encoding an HDx polypeptide. 

41. A transgenic non-human animal having cells in which an HDx gene is disrupted. 

42. A recombinant transfection system, comprising 

5 (i) a gene construct including the nucleic acid of claim 28 and operably linked to a 

transcriptional regulatory sequence for causing expression of said HDx 
polypeptide in eukaryotic cells, and 

(ii) a gene delivery composition for delivering said gene construct to a cell and 
causing the cell to be transfected with said gene construct. 
10 43. The recombinaht transfection system of claim 42, wherein the gene delivery 
composition is selected from a group consisting of a recombinant viral particle, a 
liposome, and a poly-cationic nucleic acid binding agent. 

44. A nucleic acid composition comprising a substantially purified oligonucleotide, said 
oligonucleotide including a region of nucleotide sequence which hybridizes under 

15 stringent conditions to at least 25 consecutive nucleotides of sense or antisense 

sequence of an HDx gene. 

45. The nucleic acid composition of claim 44, which oligonucleotide hybridizes under 
stringent conditions to at least 50 consecutive nucleotides of sense or antisense 
sequenc of an HDx gene. 

20 46. The nucleic acid composition of claim 44, wherein said oligonucleotide further 
comprises a label group attached thereto and able to be detected. 

47. The nucleic acid composition of claim 44, wherein said oligonucleotide has at least 
one non-hydrolyzable bond between two adjacent nucleotide subunits. 

48. A test kit for detecting cells which contain an /^£)x-encoding nucleic add, 
25 comprising the nucleic acid composition of claim 44 for measuring, in a sample of 

cells, a level of nucleic acid encoding an //Z)x protein. 

49. A method for modulating one or more of growth, differenUation, or survival of a 
mammalian cell responsive to //Dx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of an agent which modulates the 

30 deacetylase activity of an HDx polypeptide thereby altering, relative to the cell in 

the absence of the agent, at least one of (i) rate of grovrth, (ii) differentiation, or 
(iii) survival of the cell. 
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50. An antibody to an HDx polypeptide. 

5 L The antibody of claim 50, wherein said antibody is monoclonal. 

52. A diagnostic assay for identifying a cell or cells at risk for a disorder characterized 
by unwanted cell proliferation or differentiation, comprising detecting, in a cell 
5 sample, the presence or absence of a genetic lesion characterijzed by at least one of 

(i) aberrant modification or mutation of a gene encoding an HDx protein, and (ii) 
mis-expression of said gene; wherein a wild-type form of said gene encodes znHDx 
protein characterized by an ability to modulate the signal transduction activity of a 
TGFP receptor. 

10 53, The assay of claim 52, wherein detecting said lesion includes: 

i. providing a diagonistic probe comprising a nucleic acid including a region of 
nucleotide sequence which hybridizes to a sense or antisense sequence of said 
gene, or naturally occuring mutants thereof, or 5* or 3* flanking sequences 
naturally associated with said gene; 

15 ii. combining said probe with nucleic acid of said cell sample; and 

iii. detecting, by hybridization of said probe to said cellular nucleic acid, the 
existence of at least one of a deletion of one or more nucleotides from said 
gene, an addition of one or more nucleotides to said gene, a substitution of 
one or more nucleotides of said gene, a gross chromosomal rearrangement of 
20 all or a portion of said gene, a gross alteration in the level of an mRNA 

transcript of said gene, or a non-wild type splicing pattern of an mRNA 
transcript of said gene. 

54. The assay of claim 53, wherein hybridization of said probe further comprises 
subjecting the probe and cellular nucleic acid to a polymerase ch^ reaction (PCR) 

25 and detecting abnormalities in an amplified product. 

55. The assay of claim 53, wherein hybridization of said probe fiirther comprises 
subjecting the probe and cellular nucleic acid to a ligation chain reaction (LCR) and 
detecting abnormalities in an amplified product. 

56. The assay of claim 53, wherein said probe hybridizes under stringent conditions to 
30 the nucleic acid designated by SEQ ID No. 1 . 

57. An assay for screening test compounds to identify agents which inhibit the 
deacetylation of histones comprising: 
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i. providing a reaction mixture including a histone deacetylase activity of an 
HDxAike polypeptide, a substrate for a histone deacetylase, and a test 
compound; and 
ii- detecting the conversion of the substrate to product, 
5 wherein a statistically signficant decrease in the conversion of the substrate in the 

presence of the test compound is indicative of a potential inhibitor of histone 
deacetylation. 

58. The assay of claim 57, wherein the HDx-like polypeptide is of mammalian origin. 

59. The assay of claim 57, wherein the HDx-like polypeptide is an RPD3-like 
10 deacetylase of fungal origin. 

60. The assay of claim 57, wherein the reaction mixture is a reconstituted protein 
mixture. 

61. The assay of claim 57, wherein said reaction mixture is a cell lysate. 

62. The assay of claim 57, wherein the HDx4ike polypeptide is a recombinant protein. 
15 63. An assay for screening test compounds to identify agents which inhibit histone 

deacetylase interaction with cellular proteins, comprising: 

i. providing a reaction mixture including an /TZbr-like protein, an HDx- 
binding protein, and a test compound; and 

ii. detecting the interaction of the //Z)x-iike protein and the HDx binding 
20 protein, 

wherein a statistically signficant decrease in the interaction of the proteins in the 
presence of the test compound is indicative of a potential inhibitor of a histone 
deacetylase. 

64. The assay of claim 63, wherein the HDx-like protein is of mammaUan origin. 

25 65. The assay of claim 63, wherein the HDx-like polypeptide is an RPD3.1ike 
deacetylase of fungal origin. 

66. The assay of claim 63, wherein the HDx-like protein is a histone, or a portion 
thereof which mteracts with an //Dx-Iike polypeptide. 

67. The assay of claim 63, wherein the HDx-like protein is an PbAp48 protein, or a 
30 portion thereof which interacts with an /«?x-like polypeptide. 

68. The assay of claim 63, wherein the reaction mixture is a reconstituted protein 
mixture. 
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69. The assay of claim 63, wherein said reaction mixture is a cell lysate, 

70. The assay of claim 63, wherein the HDx-like polypeptide is a recombinant protein. 

71. The assay of claim 63, wherein one or both of the HDx-Aike protein and HDx- 
binding protein is a £usion protein. 

5 72. The assay of claim 63, wherein at least one of the HDxAike protein and HDx- 
binding protein comprises an endogenous detectable label for detecting the 
formation of said complex. 

73. The method of claim 63, which reaction mixture is a whole cell, and interaction of 
the HDxAikc protein and ^£)x-binding protein is detected in a two hybrid assay 

10 system . 

74. A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

15 B is selected from the group consisting of substituted and unsubstituted 

C4-Cg alkylidenes, C4-Cg alkenylidenes, C4-Cg alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or represent a C2-C7 alkylidene, a C2- 
C7 alkenylidene or a C2-C7 alkynylidene, and E represents O, S, or NR', in which 
R' represents H, a lower alkyl, a lower alkenyl, a lower alkynyl, an aralkyl, aryl, or 

20 a heterocyclyl; and 

C is selected from the group consisting of 
Y O H 

, ^ ^ > ^ ^ , and a boronic 
acid; in which Z represents O, S, or NR5, and Y; R5 represents a hydrogen, an 
alkyl, an alkoxycarbonyl, an aryloxycarbonyl, an alkylsulfonyl, an arylsulfonyl or 
25 an aryl; R'5 represents hydrogen, an alkyl, an alkenyl, an alkynyl or an aryl; and R7 

represents a hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an amino, a 
hydroxylamino, an alkoxylamino or a halogen; with the proviso that the 
compound is not trapoxin. 
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75. A pharmaceutical preparation comprising (i) the composition of claim 74 in an 
amount eflfective for inhibiting proliferation of a cell, and (ii) a pharmaceuticaJly 
acceptable diluent. 

5 76. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to //Dx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the composition of claim 74 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 
10 rate of survival of the cell. 

77. A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
15 substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-C8 alkylidenes, C4-C8 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or represent C2-C7 alkylidenes, C2-C7 
alkenylidenes or C2-C7 alkynylidenes, and E represents O, S, or NR\ in which R* 
20 represents H, a lower alkyl, a lower alkenyl, a lower alkynyl, an aralkyl, an aryl, 

or a heterocyclyl; and 

C is selected from the group consisting of 
^' Y o 



25 



,OH JL_^NH2 l-R^ 



N' 

H H O r.. ^ ^ 

■ , »n which R9 represents a hydrogen, 

an alkyl, an aryl, a hydroxyl, an alkoxy, an aryloxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 

78. A pharmaceutical preparation comprising (i) the composition of claim 77 in an 
amount eflfective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. ^ 



30 
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79. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to /ffir-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 77 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 

5 agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 

rate of survival of the cell. 

80, A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

10 A is selected from the group consisting of cycloalkyls, unsubstituted and 

substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-Cg alkylidenes, C4-C8 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or a C2-C7 alkylidene, a C2-C7 
15 alkenylidene, or a C2-C7 alkynylidene, and E represents O, S, or NR', in which R' 

is H, lower alkyl, lower alkenyl, lower alkynyl, aralkyl, aryl, or heterocyclyl; and 



20 




C represents ; in which Y is O or S, and R7 represents a 

hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an amino, a hydroxylamino, an 
alkoxyiamino or a halogen. 

81. A pharmaceutical preparation comprising (i) the composition of claim 80 in an 
amount effective for inhibiting proliferation of a cell, and (ii) a pharmaceuticaily 
acceptable diluent. 



25 82. A method for modulating one or more of growth, differentiation, or survival of a 
manunalian cell responsive to //Dx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 80 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 

30 rate of survival of the cell. 
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Figure lA 
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Figure IB 
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Figure 2A 




K-trap affinity matrix 
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Figure 2B 
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Figure 3B 
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Figure 4 A 
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Figure 4B 
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Figure 4C 
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Figure 4D 
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Sequences 

5> R18769 Hf 3> R21136 4> F07807 

5> R18769 6> D31480 7> R96879 e> N59055 

6 > atggcgcagacgcagggcacccggaggaaagtntgttactactacgacggggatgSSga 

I " t^II*"^'^''^'^*^''°''^°°^=*===AATGAAGCCTCACCGLTCCGCATGACTCAT^^ 

6 > aattactattatggacaaggccacccaatgaagcctcaccgaatccgca?gaS?a^^^ 

150 

!!?"°''^^'^^^*^°*^^^^c^*ccgaaaaatggaaatctatcgccctcacaaagccaat 

6 > l-TGCTGCTCAACTATGGTCrCTACCGAAAAATGGAAATCTATCGNCC^SSJil^^JJ? 

200 

1> 3CTGAGGAGATGACCAAGTACCACAGCGATGACTACATTAAATTCTTGCGCTCCATCCGT 

el NCTGAGGAGATGACCAAGTANCACAGCGATGAC ^^^^^^^^'^^^^^'^^^^^^CAGC 
7> 



TCCTGCAGAGAGTCAGC 



250 



J" ^fJ°*^*^^*^°"^^°°^°^ACAGCAAGCAGATGCAGAGATTCAACGTTGGTGAGGACTGj°° 

2> cccaccaatatgcaaggcttcaccaagagtcttaatgccttcaacgtaggcS?SSg? 

7> CCCACCAATATGCAAGGCTTCACCAAGAGTCTTAATGCCrrCAACGTAGGSJJSS^^ 

i> ccagtattcgatggcctgtttgagttctgtcagttgtctactggtggttctItggcaagt 
2 > ccagtgtttcccgggctctttgagttctgctcgcgttacacaggcgcatSSgcSSgI 

r P^J?:™^^^°°°^^^'^°'^^-^^^'^^^^=^=^ACACAGG^ 

7> ccagtgtttcccgggctctttgagttctgctcgcgttacacaggcgcatctSgc^^^ 



450 



GCAACCCAGCTGAACAACAAGATCTGTGATATTGCCATTAACTGGGCTGGTNGT^^^ 



1 > CATGCAAAGAAGTCCGAGGCATCTGGCTTCTGTTACi 

CATGCCTAGAAGTTTGAGGCCTCrGGCTTCTGCTATGTCAACGACATTGTGTTT^ 



2> 



:gtcaatgatatcgtcttggccatc 
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Figure 5A (conU) 



NATGCCANGANGTTTNAGGCCTCTGGNTTCTGCTATGTCAACGACATTGTGATTGGCATC 
CATGCCAAGAAGTTTGAGGCCTCTGGTTTCTGCTATGTCAACGACATTGTGATTGGCATC 

500 . . 

CTGGAACTGCTAAAGTATCACCAGAGGGtgCTGTACATTGACATTGATATTCACCATGGT 

CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCATGGT 
CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCATGGT 
CTGGAGCTGCTCAAGTACCACCCTCGGGTGCTCTACATTGACATTGACATCCACCA 



550 . . . . 60 

GACGGCGTGGAAGAGGcCTTCTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG 
GACGGGGTTCAAGAAGCTTTCTACCTCACTGACC 

GACGGGGTTCAAGAAGCTTTCTACCTCACTGACCGGGTCATGACGGTGTCCTTTCCACAA 

CTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG 

650 

TATGGAGAGTACTTCCCAGGAACTGGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG 
ATACGGGAAATTTACTTNTTCCNGGGGCACAGGTGACATGTTNTGGAAGTTCGGGGGGCA 
TATGGAGAGTACTTCCCAGGGACTTGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG 



700 

TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTGATGACGAGTCCTATGAGGCCATT 
TACTACTGTCTGAACGTGCCCCTGCGGATGGGCATTGATGACCAGAGTTACAAGCACCTT 
GGAGAGTTGGCCC 

TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTNATGACGAGTCCTATGAGGCCATT 

750 

TTCAAGCCGGTCATGTCCAAAGTAATGGAGATGTTCCAGCCTAGTGCGGTGGTCTTACAG 
TTCCAGCCGGTTATCAACCAGGTAGTGGACTTCTACCAACCCACGTGCATTGTGCTCCAG 

CCCTATAGTGAGTCGTATTNN 
TTCAAGCCGGTCATGTCCAAAGTAATNGAGATGTTCCAGCCTAGTGCG 

800 , . . 

TGTGGCTCAGACTCCCTATCTGGGGATCGGTTAGGTTGCTTCAATCTAACTATCAAAGGA 
TGTGGAGCTGACTCTCTGGGCTGTGATCGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 
TNAAAACATGACTCACTNGGNTNNNTACGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 

AGGT 

850 - . . . 90 

CACgCCAAGTGTGTGGAATTTGTCAAGAGCTTTAACCTGCCTATGCTGATGCTGGGAGGC 
CATGGGGAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 

GGA 

CATGGGNAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 
NATGCTAAATGTGTAGAAGTTGTAAAAACTTTTAACTTACCATTACTGATGCTTGGAGGA 
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Figure 5A (cont) 



950 

1 > GGTGGTTACACCATTCGTAACGTTGCCCGGTGCTGGACATATGAGACAGCTGTGGCCCTG 

3 > GGTGGTTATACTGTCCGAAATGTTGCCCGCTGCTGGACATATGAGACATCGCTGCTGGTA 

4 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

8 > GGTGGTTATACTGTCCGAAATGTNGCCCGCTGCTGGACATATGAGACANCGCTGCTGGTA 

9 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

1000 

1 > GATACGGAGATCCCTAATGAGCTTCCATACaATGACTACTTTGAATACTTTGGACCAGAT 

3 > GAAGAGGCCATTAGTGAGGAGCTTCCCTATAGTGAATACTTCGAGTACTTTGCCCCAGAC 

4 > GATTGTGAGATTCCCAATGAGTTGCCATATAATGATTACTTTGAGTATTTTGGACCAGAC 
8 > GAAGAGGCCATTAGTGAGGAGCTTCCCTAATAGTGAATACTTCGNTACTTTGCCCCAGAC 
9> GATTGTGAGATTCCCAATGGTAAGTGTTCTCATTACAATATCTTTATTGTATG 

1050 

1 > TTCAAGCTCCACATCAGTCCTTCCAATATGACTAACCAGAACACGAATGAGTACC tGGAG 
3> TTCACACT 

4 > TTCAAACTGCATATTAGTCCTTCAAACATGACAAACCAGAACAC 

a > TTCACACTTCATCCANATGTCAGCACCCGCATCGAGAATCCAGAACTCACGCCAGTATC 

1100 

1 > AAGATCAAACAGCGACTGTTTGAGAACCTTAGAATGCTGCCGCACGCACCTGGGGTCCAA 

8 > NGGACCAAGATCCGCCAGACAATCTTTGNAAACCTGAAGGTTCTTNAACC 

1150 - . . , 12 

1> ATGCAGGCGATTCCTGAGGACGCCATCCCTGAGGAGAGTGGCGATGAGGACGAAGACGAC 

1250 

1 > CCTGACAAGCGCATCTCGATCTGCTCCTCTGACAAACGAATTGCCTGTGAGGAAGAGTTC 

. - 1300 

1 > TCCGATTCTGAAGAGGAGGGAGAGGGGGGCCGCAAGAACTCTTCCAACTTCAAAAAAGCC 

1350 

> AAGAGAGTCAAAACAGAGGATGAAAAAGAGAAAGACCCAGAGGAGAAGAAAGAAGTCACC 

1400 

> GAAGAGGAGAAAACCAAGGAGGAGAAGCCAGAAGCCAAAGGGGTCAAGGAGGAGGTCAAG 

> TTGGCCTGA 



F06693 

> H05234 

> R21136 
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Figure 5B 



HDl ( 1) maqtqGTRRKVCYYYDGDVGNYYYGOGHPMKPKRIRMTHNLLLITYGLyRK 

( ij mvyeatpfdpITVKPSDKRRVAYFYDADVGNYAYGAGHPMKPKRIRMAHSLIMfTYGLYKK 

x_rpd3 ( 1) MALT1X3TKKKVCYYYDGDVG^^VTYG0GHPMKPHRIR^^^HNLLLNY 

< Es t A > IDFLQRVS PTNMOGFTKSLNAFNVGDDCPVFPGLFEFC 
^1 ( 51) MEIYRPHKAKAEEMTKYHSDDYIKFLiRSIRPONMSEYSKOMQRFNVGEDCPVFDGt,FEFC 

{ 61) MEIYRAKPATKQEMCOFHTDEYIDFL.SRVTPDNLEMFKRESVKFNVGDDCPVFDGLYEYC 

^_rpti3 { 51) MEIFRPHKASAEDMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGl^FEFC 

<EstA> SRYTGASLQGATQI-NNKICDIAlNWAGGL.HHAKKFEASGFCYVNDIVFGILELLiKYHPRV 

HDl ( 111) 0LSTGGSVASAVKlJ4K0QTDIA\rtWAGGLHHAKKSEASGFCrrVNDIVIJiLlLELL*KYHO 

RPD3 ( 121) SISGGGSMEGAARLNRGKCDVAVNYAGGLHHAKKSEASGFCYLNDIVIXSIIELLRYHPRV 

X_rpci3 { 111) 01^AGGSVASAVKLNKQ0TDISVNWSGGLHHAKKSEASGFCyVNDIVLAILEL.LKYHQRV 

< Es t B > YYCLNVPLRjyj 
<EstA> LYIDIDIHHGDGVOEAFYLTDRVKTVSFPOIREIY 

KOI ( 171) LYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDL,RDIGAGKGKYYAWNYPr>RD 

( 181) LYIDlD\mHGDGVEEAFYTTDRVMTCSFHKYGEFFPGTGELRDrGVGAGKNYAVNVPLRD 

x_rpd3 ( 171) VYIDIDXHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNYALRD 

<EstC> NLLVLGHAKCVEWKT 

<EstB> GIDDQSYKHLFOPVINOWDFYQPTCIVLQCGADSLGCDRLGCFNLSIRGHGECVEYVKS 

HOI ( 231) GIDDESYEAIFKPVMSKVMEMFQPSAVVLOCGSDSLSGDRLGCFNLTIKGHAKCVEFVKS 

RPD3 ( 241) GIDDATYRSVFEPVIKKIMEWYQPSAWLQCGGDSLSGDRLGCFNLSMEGHANCVNYVKS 

^_rpd3 ( 231) GIDDESYEAIFKPVMSKVMEMFQPSAWLOCGADSLSGDRLGCFNLTIKGKAKCVEFIKT 

<5/'4 > FNLPLLMLGGGGYTIRNVARCWTYETAVAI>DCEIPNELPYNDYFEYFGPDFKI,HISPSNM 

< 3 > FNIPLLVt.GGGGYTVRNVARCVfTYETSLLVEEAISEELPYSEYFEYFAPDFTl.HP 

HOI ( 291) FNLPMLMLGGGGYTXRNVARCVrrYETAVALDTEIPNELPYNDYFRYFGPDFKLHISPSNM 

RPD3 ( 3 01) FGIPIWn^GGGGYTMRNVARTWCFETGLLNNVVLDKDLPYNEYYEYYGPDYKLSVRPSNM 

x_rpd3 { 291) FNLPLLMLGGGGYTIRNVARCVrrYETAVALDSEIPNELPYNDYFEYFGPDFKLHISPSNM 

. ;} 

cEstC> TNQN 

<EstB> (351) TNONTNEYLEKIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESGDEDEDDPDKRISICSS 

RP03 ( 3 61) FNVNTPEYLDKVMTNIFANLEMTKYAPSVQLNKTPRDaedlgdveedsaeakdt)cggsqy 

^_rpd3 < 351) TNQNTNEYLEKIKORLFENLRMLPHAPGVOMQAVAEDSIHDDSGEEDEDDPDKRISIRSS 

HOI ( 411) DKRIACEEEFSDSEEEGEGGRKNSSNFKKAKRVKTEDEREkdPEEKKEVTEEEKTKEEKP 

RPD3 ( 421) ardlhvehdnefy-- 

X_rpd3 ( 411) DKRIACDEEFSDSEDEGEGGRKNVANFKKVKRVKTEEEKE- -GEDKKDVKEEEKAKDEKT 

HDl < 4 71) EAKGVKEEVRla 

RPD3 ( 4 34) 

3t_rpd3 ( 46 9) DSKRVKEETKsv 
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Specificity Element A 
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Figure 7 
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Figure 8A 
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Figure 8B 
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Figure 8C 
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Figure 9B 
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Figure 12 
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Figure 13 
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Government Funding 

Work described herein was supported in part by funding from the National 
> Institute of Health. The United States Govertunent h^s certain rights in the invention. 

Background of the Invention 

The organization of regulatory DNA elements into precise chromatin structures is 
miportant for both DNA repUcation and transcription in vivo (Lee et al (1993) CeU 
72.73-84; Felsenfeld (1992) Nature. 355:219). In eukaryotic cells, nuclear DNA exists as 
a hierarchy of chromatin structures, resulting in the compaction of nuclear DNA about 
10,000 fold (Davie and Hendzel (1994) J. Cell. Biochem. 55:98). The repeating 
structural unit in the extended 10 nm fibre form of chromatin is the nucleosome (van 
Holde (1 988) Chromatin . New York: Springer-Verlag). The nucleosome consists of 146 
bp of DNA wrapped around a protein core of the histones H2A, H2B H3 and H4 
known as the core histones. These histones are arranged as an (H3-H4)2 tetramer and 
two H2A-H2B dimers positioned on each face of the tetramer. The DNA joining the 
nucleosomes is called linker DNA; it is to the linker DNA to which the HI or linker 
histones bind. The 10 nm fibre is compacted fiirther into the 30 mn fibre Linker 
histones and amino-terminal regions ("tails") of the core histones maintain the higher 
order folding of chromatin (Garcia Ramirez et al.. (1992) J. Biol Chem 267:19587). This 
chromatin structure must be relaxed when DNA is transcribed or translated. 

Histones of the nucleosome core particle are subject to reversible acetylation at 
the e-amino group of lysines present in their amino terminus (Csordas et al (1990) 
Biochem J 265:23-38). Transcriptionally silent regions of the genome are enriched in 
underacetylated histone H4 (Turner (1993) Cell 75:5-8), and histone hyperacetylation 
facUitates the ability of transcription factor TFIIIA to bind to chromatin templates (Lee et 
al. (1993) Cell 72:73-84). Recent genetic, biochemical and immunological approaches 
have provided substantial evidence indicating that histones associated with actively 
transcnbed genes are more highly acetylated than those from nontranscribed regions 
While not wishing to be bound by any particular theory, histone acetylation may influence 
transcription at several stages, for example, by causing transcription factors to bind or by 
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inducing structural transitions in chromatin, or by facilitating histone displacement and 
repositioning during polymerase elongation. 

The acetylation and deacetylation are catalyzed by specific enzymes, histone 
acetyltransferase and deacetylase, respectively, and the net level of the acetylation is 
5 controlled by the equilibrium between these enzymes. The steady state level of 
acetylation and the rates at which acetate groups are turned over vary both between and 
within different cell types, with half-Hves that vaiy from a few minutes to several hours. 
Although a histone acetyltransferase gene (HATl) has been identified in yeast (Kelff et 
al. (1995) J. Biol Chem. 270:24674-24677), the molecular entities responsible for 
1 0 histone deacetylation were heretofore unknown in the art. 

The identification of the mechanism by which histones are deacetylated would be 
of great benefit in the control of gene transcription and the cell cycle. 
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Summary of the Invention 

The present invention relates to the discovery of a novel family of genes, and gene 
products, expressed in mammals, which genes are referred to hereinafter as the "histone 
deacetylase" genes or "HDx" gene family, the products of which are referred to as 
histone deacetylases or HDx proteins. 

In general, the invention features isolated HDx polypeptides, preferably 
substantially pure preparations of one or more of the subject HDx polypeptides. The 
invention also provides recombinantly produced HDx polypeptides. In preferred 
embodiments the polypeptide has a biological activity including an abUity to deacetylate 
an acetylated histone substrate, preferably a substrate analog of histone H3 and/or H4. In 
other embodiments the HDx polypeptides of the present invention bind to trapoxin or to 
trichostatin, such binding resulting in the inhibition a deacetylase activity of the HDx 
polypeptide. However, HDx polypeptides which specifically antagonize such activities, 
such as may be provided by dominant negative mutants, are also specifically 
contemplated. 

The HDx polypeptides disclosed herein are capable of modulating proliferation, 
suivival and/or differentiation of ceUs. because of their ability to alter chromatin structure 
by deacetylating histones such as H3 or H4. Moreover, in preferred embodiments, the 
subject HDx proteins have the ability to modulate cell grov^lh by influencing cell cycle 
progression or to modulate gene transcription. 



wo 97/35990 



PCT/US97/05275 



-3- 



10 



15 



In one embodiment, the polypeptide is identical with or homologous to an HDx 
protein. Exemplary HDx polypeptide include amino acid sequences represented in any 
one of SEQ ID Nos 5-8. Related members of the HDx family are also contemplated, for 
instance, an HDx polypeptide preferably has an amino acid sequence at least 85% 
homologous to a polypeptide represented by one or more of the polypeptides designated 
SEQ ID Nos: 5-8, though polypeptides with higher sequence homologies of, for example, 
88, 90% and 95% or are also contemplated. In one embodiment, the HDx polypeptide is 
encoded by a nucleic acid which hybridizes under stringent conditions with a nucleic acid 
sequence represented in one or more of SEQ ID Nos. 1-4. Homologs of the subject HDx 
proteins also include versions of the protein which are resistant to post-translation 
modification, as for example, due to mutations which alter modification sites (such as 
tyrosine, threonine, serine or aspargine residues), or which inactivate an enzymatic 
activity associated with the protein. 

The HDx polypeptide can comprise a full length protein, such as represented in 
SEQ ID No. 5, or it can comprise a fi-agment corresponding to particular motifs/domains, 
or to arbitrary sizes, e.g., at least 5, 10, 25, 50, 100, 150 or 200 amino acids in length. Iii 
preferred embodiments, the polypeptide, or fragment thereof, specifically deacetylates 
histone H4. In other preferred embodiments, the HDx polypeptide includes both a v 
motif (SEQ ID No. 12) and a % motif (SEQ LD No. 14), preferably a v motif represented 
in the general formula SEQ ID No. 13, and a x motif represented in the general formula 
SEQ ID No. 15. 

In certain preferred embodiments, the invention features a purified or recombinant 
HDx polypeptide having a molecular weight in the range of 40kd to 60kd. For instance, 
preferred HDx polypeptides, have molecular weights in the range of 50k:d to about 60kd, 
even more preferably in the range of 53-58kd. It will be understood that certain post- 
translational modifications, e.g., phosphorylation, prenylation and the like, can increase 
the apparent molecular weight of the HDx protein relative to the unmodified polypeptide 
chain. 

The subject proteins can also be provided as chimeric molecules, such as in the 
30 form of fiision proteins. For instance, the HDx protein can be provided as a recombinant 
fiision protein which includes a second polypeptide portion, e.g., a second polypeptide 
having an amino acid sequence unrelated (heterologous) to the HDx polypeptide, e.g. the 
second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide 
portion is an enzymatic activity such as alkaline phosphatase, e.g. the second polypeptide 
35 portion is an epitope tag. 
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In yet another embodiment, the invention features a nucleic acid encoding a an 
HDx polypeptide, or polypeptide homologous thereto, which polypeptide has the ability 
to modulate, e.g., either mimic or antagonize, at least a portion of the activity of a wild- 
type HDx polypeptide. Exemplary //Dx-encoding nucleic acid sequences are represented 
5 by SEQIDNos: 1-4. 

In another embodiment, the nucleic acid of the present invention includes a 
coding sequence which hybridizes under stringent conditions with one or more of the 
nucleic acid sequences in SEQ ED Nos: 1-4. The coding sequence of the nucleic acid can 
comprise a sequence which is identical to a coding sequence represented in one of SEQ 
10 ID Nos: 1-4, or it can merely be homologous to one or more of those sequences. In 
preferred embodiments, the nucleic acid encodes a polypeptide which specifically 
modulates, by acting as either an agonist or antagonist, the enzymatic activity of an HDx 
polypeptide. 

Furthermore, in certain preferred embodiments, the subject HDx nucleic acid will 
15 include a transcriptional regulatory sequence, e.g. at least one of a transcriptional 
promoter or transcriptional enhancer sequence, which regulatory sequence is operably 
linked to the HDx gene sequence. Such regulatory sequences can be used in to render 
the HDx gene sequence suitable for use as an expression vector. This invention also 
contemplates the cells transfected with said expression vector whether prokaryotic or 
20 eukaryotic and a method for producing HDx proteins by employing said expression 
vectors. 

In yet another embodiment, the nucleic acid hybridizes under stringent conditions 
to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of either 
sense or antisense sequence of one or more of SEQ ED Nos: 1-4; though preferably to at 
25 least 25 consecutive nucleotides; and more preferably to at least 40, 50 or 75 consecutive 
nucleotides of either sense or antisense sequence of one or more of SEQ ED Nos: 1-4. 

Yet another aspect of the present invention concerns an immunogen comprising 
an HDx polypeptide in an immunogenic preparation, the immunogen being capable of 
eliciting an immune response specific for an HDx polypeptide; e.g. a humoral response, 
30 e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the 
inununogen comprising an antigenic determinant, e.g. a unique determinant, fi-om a 
protein represented by one of SEQ ED Nos. 5-8. 

A still further aspect of the present invention features antibodies and antibody 
preparations specifically reactive with an epitope of the HDx immunogen. 
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The invention also features transgenic non-human animals, e.g. mice, rats, rabbits, 
chickens, frogs or pigs, having a transgene, e.g., animals which include (and preferably 
express) a heterologous form of an HDx gene described herein, or which misexpress an 
endogenous HDx gene, e.g., an animal in which expression of one or more of the subject 
HDx proteins is disrupted. Such a transgenic animal can serve as an animal model for 
studying cellular and tissue disorders comprising mutated or mis-expressed HDx alleles 
or for use in drug screening. 

The invention also provides a probe/primer comprising a substantially purified 
oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 12 consecutive nucleotides of 
sense or antisense sequence of SEQ ID Nos: 1-4, or naturally occurring mutants thereof 
Nucleic acid probes which are specific for each of the HDx proteins are contemplated by 
the present invention, e.g. probes which can discern between nucleic acid encoding a 
human or bovine HD. In preferred embodiments, the probe/primer fiirther includes a 
label group attached thereto and able to be detected. The label group can be selected, 
e.g., fi-om a group consisting of radioisotopes, fluorescent compounds, enzymes, and 
enzyme co-factors. Probes of the invention can be used as a part of a diagnostic test kit 
for identifying dysfijnctions associated with mis-expression of an HDx protein, such as 
for detecting in a sample of cells isolated from a patient, a level of a nucleic acid encoding 
a subject HDx protein; e.g. measuring an HDx mRNA level in a cell, or determining 
whether a genomic HDx gene has been mutated or deleted. These so caUed 
"probes/primers" of the invention can also be used as a part of "antisense" therapy which 
refers to administration or in situ generation of ohgonucleotide probes or their 
derivatives which specifically hybridize (e.g. bind) under ceUular conditions, with the 
ceUular mRNA and/or genomic DNA encoding one or more of the subject HDx proteins 
so as to inhibit expression of that protein, e.g. by inhibiting transcription and/or 
translation. Preferably, the oligonucleotide is at least 12 nucleotides in length, though 
primers of 25, 40, 50, or 75 nucleotides in length are also contemplated. 

In yet another aspect, the invention provides an assay for screening test 
compounds for inhibitors, or alternatively, potentiators, of an interaction between an 
HDx protein and an HDx binding protein or nucleic acid sequence. An exemplary 
method includes the steps of (i) combining an HDx polypeptide or fragment thereof, one 
or more HDx target polypeptide (such as a histone, SIN3, RpAp48 or other protein 
which participates in HDx complexes, e.g., one or more proteins having molecular 
weights of 250 kDa. 180 kDa, 55 kDa, 50 kDa, 42 kDa, 33-36 kDa and 30 kDa, see also 
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Example 3), and a test compound, e.g., under conditions wherein, but for the test 
compound, the HDx protein and target polyp eptide(s) are able to interact; and (ii) 
detecting the formation of a complex which includes the HDx protein and target 
polypeptide(s) either by directly quantitating the complex, the deacetylase activity of the 
5 HDx protein, or by measuring inductive effects of the HDx protein. A statistically 
significant change, such as a decrease, in the formation of the complex in the presence of 
a test compound (relative to what is seen in the absence of the test compound) is 
indicative of a modulation, e.g., inhibition, of the interaction between the HDx protein 
and its target polypeptide. 

10 Furthermore, the present invention contemplates the use of other homologs of the 

HDx polypeptides or bioactive fragments thereof to generate similar assay formats. In 
one embodiment, the drug screening assay can be derived with a fungal homolog of an 
HDx protein, such as RPD3, in order to identify agents which inhibit histone 
deacetylation in a yeast cell. 

15 Yet another aspect of the present invention concerns a method for modulating 

one or more of growth, differentiation, or survival of a mammalian cell by modulating 
HDx bioactivity, e.g., by inhibiting the deacetylase activity of HDx proteins, or disrupting 
certain protein-protein interactions. In general, whether carried out in vivo, in vitro, or 
in situ, the method comprises treating the cell with an effective amount of an HDx 

20 therapeutic so as to alter, relative to the cell in the absence of treatment, at least one of 
(i) rate of grov^h, (ii) differentiation, or (iii) survival of the cell. Accordingly, the 
method can be carried out with HDx therapeutics such as peptide and peptidomimetics or 
other molecules identified in the above-referenced drug screens which antagonize the 
effects of a naturally-occurring HDx protein on said cell. Other HDx therapeutics include 

25 antisense constructs for inhibiting expression of HDx proteins, and dominant negative 
mutants of HDx proteins which competitively inhibit protein-substrate and/or protein- 
protein interactions upstream and downstream of the wild-type HDx protein. 

In an exemplary embodiment the subject method is used to treat tumor cells by 
antagonizing HDx activity and blocking cell cycle progression. In one embodiment, the 

30 subject method includes the treatment of testicular ceUs, so as modulate spermatogenesis. 
In another embodiment, the subject method is used to modulate osteogenesis, comprising 
the treatment of osteogenic cells v/ith an HDx polypeptide. Likewise, where the treated 
cell is a chondrogenic cell, the present method is used to modulate chondrogenesis. In 
still another embodiment, HDx polypeptides can be used to modulate the differentiation 

35 of progenitor cells, e.g., the method can be used to cause differentiation of a 
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hematopoietic cells, neuronal cells, or other stem/progenitor cell populations, to maintain 
a that cell in a differentiated state, and/or to enhance the survival of a differentiated cell, 
e.g., to prevent apoptosis or other forms of cell death. 

In addition to such HDx therapeutic uses, anti-fungal agems developed with such 
5 screening assays as described herein can be used, for example, as preservatives in 
foodstuff, feed supplement for promoting weight gain in livestock, or in disinfectant 
formulations for treatment of non-living matter, e.g., for decontaminating hospital 
equipment and rooms. In similar fashion, assays provided herein will permit selection of 
deacetylase inhibitors which discriminate between the human and insect deacetylase 
10 enzymes. Accordingly, the present invention expressly contemplates the use and 
formulations of the deacetylase inhibitors in insecticides, such as for use in management 
of insects Uke the fhiit fly. Moreover, certain of the inhibitors can be selected on the 
basis of inhibitory specificity for plant ^Z)x-related activities relative to the mammalian 
enzymes. Thus, the present invention specifically contemplates formulations of 
deacetylase inhibitors for agricultural applications, such as in the form of a defoliant or 
the like. 

The present method is applicable, for example, to cell culture technique, such as 
in the culturing of hematopoietic cells and other cells whose survival or diflferentiative 
state is dependent on HDx function. Moreover, HDx agonists and antagonists can be 
used for therapeutic intervention, such as to enhance survival and maintenance of cells, as 
well as to influence organogenic pathways, such as tissue patterning and other 
differentiation processes. In an exemplary embodiment, the method is practiced for 
modulating, in an animal, cell growth, cell differentiation or cell survival, and comprises 
administering a therapeutically effective amount of an HDx polypeptide to alter, relative 
the absence of HDx treatment, at least one of (i) rate of growth, (ii) differentiLtion, or 
0") survival of one or more cell-types in the animal. 

Another aspect of the present invention provides a method of determining if a 
subject, e.g. a human patient, is at risk for a disorder characterized by unwanted cell 
proliferation or aberrant control of differentiation. The method includes detecting, in a 
tissue of the subject, the presence or absence of a genetic lesion characterized by at' least 
one of (i) a mutation of a gene encoding an HDx protein, e.g. represented in one of SEQ 
ID Nos: 1-4, or a homolog thereof; or (ii) the mis-expression of an HDx gene. In 
preferred embodimems, detecting the genetic lesion includes ascertaining the existence of 
at least one of a deletion of one or more nucleotides from an HDx gene; an addition of 
one or more nucleotides to the gene, a substitution of one or more nucleotides of the 
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gene, a gross chromosomal rearrangement of the gene; an alteration in the level of a 
messenger RNA transcript of the gene; the presence of a non-wild type splicing pattern of 
a messenger RNA transcript of the gene; or a non-wild type level of the protein. 

For example, detecting the genetic lesion can include (i) providing a probe/primer 
5 including an oligonucleotide containing a region of nucleotide sequence which hybridizes 
to a sense or antisense sequence of an HDx gene, e.g. a nucleic acid represented in one of 
SEQ ID Nos: 1-4, or naturally occurring mutants thereof, or 5' or 3* flanking sequences 
naturally associated with the HDx gene; (ii) exposing the probe/primer to nucleic acid of 
the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the 

10 presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises 
utilizing the probe/primer to determine the nucleotide sequence of the HDx gene and, 
optionally, of the flanking nucleic acid sequences. For instance, the probe/primer can be 
employed in a polymerase chain reaction (PCR) or in a ligation chain reaction (LCR), In 
alternate embodiments, the level of an HDx protein is detected in an immunoassay using 

15 an antibody which is specifically immunoreactive with the HDx protein. 

In another aspect, the invention provides compounds useful for inhibition of 
HDxs. In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C, in which A is a specificity element for selective 
binding to an HDx^ B is a linker element, and C is an electrophilic moiety capable of 
20 reacting with a nucleophilic moiety of an HDx\ with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

For instance, in one embodiment, there is provided a composition for inhibiting a 
histone deacetylase comprising a compound represented by the general formula A-B-C, 
wherein 

25 A is selected fi-om the group consisting of cycloalkyls, unsubstituted and 

substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected firom the group consisting of substituted and unsubstituted C4-Cg 
alkylidenes, C4-C3 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in which D and F 
are, independently, absent or represent a Cj-Cj alkylidene, a C2-C7 alkenylidene or a C2- 
30 C7 alkynylidene, and E represents O, S, or NR*, in which R' represents H, a lower alkyl, a 
lower alkenyl, a lower alkynyl, an aralkyl, aryl, or a heterocyclyl; and 

C is selected fi-om the group consisting of 
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^ O OO OO ,,^. 

' > , , and a boronic acid; in 

which Z represents O, S, or NR5, and Y; R5 represents a hydrogen, an alkyl, an 
alkoxycarbonyl, an aryloxycarbonyl, an alkylsulfonyl, an arylsulfonyl or an aryl;'R„ 
represents hydrogen, an alkyl, an alkenyl, an alkynyl or an aryl; and R7 represents I 
5 hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an amino, a hydroxylamino. an 
alkoxylamino or a halogen; with the proviso that the compound is not trapoxin. 

In another preferred embodiment, the compound represented by the general 
formula A-B-C, wherein 

A IS selected from the group consisting of cycloalkyls, unsubstituted and 
10 substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted C4-C8 
alkylidenes, C4-C8 alkenylidenes, C4-Cg alkynylidenes, and -(D-E-F)-, in which D and F 
are, independently, absent or represent C2-C7 alkylidenes, C2-C7 alkenylidenes or C2-C7 
alkynylidenes, and E represents O, S, or NR', in which R' represents H, a lower alkyl, a 
lower alkenyl, a lower alkynyl, an aralkyl, an aryl, or a heterocyclyl; and 

C is selected from the group consisting of 
Y Y 



,OH .NH2 



H H O . . 

. Jn which R9 represents a hydrogen, an alkyl, 

an aryl, a hydroxyl, an alkoxy, an aryloxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 

20 In still another preferred embodiment, the compound is represented by the general 

formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted C4-C8 
25 alkylidenes, C4-C8 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)-, in which D and F 
are, independently, absent or a C2-C7 alkylidene, a C2-C7 alkenylidene, or a C2-C7 
alkynylidene, and E represents O, S, or NR', in which R' is H, lower alkyl, lower alkenyl, 
lower alkynyl, aralkyl, aryl, or heterocyclyl; and 
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C represents ; in which Y is O or S, and Ry represents a hydrogen, an 

alkyl, an aryl, an alkoxy, an aiyloxy, an amino, a hydroxylamino. an alkoxylamino or a 
halogen. 

The present invention also contemplates pharmaceutical preparations of such 
compounds, e.g., in an amount eflfective for inhibiting proliferation of a cell, formulated 
in a pharmaceutically acceptable diluent. 

Moreover, such compounds can be used for modulating one or more of growth, 
differentiation, or survival of a mammalian cell responsive to ^Z>x:-mediated histone 
deacetylation, by treating the cell with an effective amount of the deacetylase inhibitor so 
as to modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the rate of 
survival of the cell. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic 
biology, microbiology, recombinant DNA, and immunology, which are within the skill of 
the art. Such techniques are explained fully in the literature. See, for example. Molecular 
Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold 
Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover 
ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent 
No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of 
Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes 
(IRL Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the 
treatise. Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors 
For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor 
Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.). 
Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds.,' 
Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I- 
IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embiyo, 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). ^ 
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Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

Brief Description of the Drawings 
Figure lA iUustrates the chemical structures of trapoxin and trichostatin, natural 
products that inhibit the enzymatic deacetylation of lysine residues near the NHj-terminus 
of histones. The epoxyketone side chain of trapoxin is approximately isosteric with N- 
acetyl lysine and likely alkylates an active site nucleophile. 

Figure IB illustrates the copurification of trapoxin binding and histone 
deacetylase activities. Nuclear proteins from bovine thymus were precipitated with 
ammonium sulfate and fractionated on a Mono Q column. Trapoxin binding was 
assayed by charcoal precipitation with [3H]trapoxin. For the histone deacetylase assay, 
a peptide corresponding to bovine histone H4 (1-24) was synthesized. The peptide was 
chemically acetylated with sodium [3H]acetate (5.3 Ci/mmol. New England 
Nuclear)/BOP reagent (Aldrich) and purified by reverse phase HPLC. Two microliters of 
[3H]peptide(~4O,OO0dpm) were used per 200 ^1 assay. After incubation at 37oc for one 
hour, the reaction was quenched with 1 M HCl/0.16 M acetic acid (50 \x\). Released 
pHJacetic acid was extracted with 600 ^1 of ethyl acetate and quantified by scintillation 
counting. Pretreatment of crude or partially purified enzyme with trapoxin or trichostatin 
(20nM) for 30 min. at 40C abolished deacetylase activity. A28o= absorbance at 280 nm. 

Figure 2A shows the synthesis of K-trap and the K-trap affinity matrix. K-trap 
contains a protected lysine residue in place of the phenylalanine at position two in 
trapoxin. Alloc = allylcxycarbonyl. 

Figure 2B is a silver stained gel showing bovine and human trapoxin binding 
proteins. Proteins bound to the K-trap affinity matrix in the presence or absence of 
trapoxin or trichostatin were eluted by boiling in SDS loading buffer and analyzed by 
SDS-PAGE (9% gel). Nuclear proteins from human Jurkat T cells were prepared 
identically to those from bovine thymus (Figure IB). Molecular size standards (in 
kilodaltons) are indicated to the right. 

Figure 3 A is the predicted amino acid sequence of human HDl. An in-frame stop 
codon was found upstream of the starting methionine. Regions equivalent to 
microsequenced tryptic peptides from the purified bovine protein are boxed. Underlined 
amino acids 319-334 and 467-482 denote the sequences of synthetic peptides that were 
conjugated to KLH and used to generate polyclonal antisera. Abbreviations for the 
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amino acid residues are: A, Ala; C, Cys; D, Asp; E. Glu; F, Phe; G, Gly; H, His; I, He- K, 
Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. 

Figure 3B is a protein immunoblot analogous to the silver stained gel in Figure 
5 2B, showing the relationship between bovine p46-p49 and human p55 (top panels) and 
confirming the identity of p50 (bovine and human) as RbAp48 (bottom panels). Proteins 
eluted from the K-trap affinity matrix (Figure 2) were separated by SDS-PAGE and 
transferred to Immobilon-P (Millipore). Blots were probed with polyclonal anti-^Dl 
(319-336) or monoclonal anti-RbAp48 and bound antibodies were detected with 
10 enhanced chemiluminescence (Amersham). 

Figure 4A is an immunoprecipitation of endogenous histone deacetylase activity 
with affinity purified anti-^Z) 1(467-482) antibodies. Anti-^Z) 1(467-482) 

immunoprecipitates fi-om equivalent amounts of Jurkat nuclear extract (1 mg nuclear 
protein supplemented with 0.5 M NaCl, 1% BSA, and 0.1% NP-40) were isolated in the 
15 presence or absence of ffi)l(467-482) peptide competitor. After resuspending the 
immunoprecipitates in BDx buffer [20 mM tris (pH 8), 150 mM NaCl. 10% glycerol], 
inhibitors were added as indicated, and histone deacetylase activity was measured as 
described in Figure 1 A. 

Figure 4B shows the coprecipitation of HDJ and RbAp48, as detected by protein 
20 immunoblot analysis. 

Figure 4C demonstrates the histone deacetylase activity of recombinant HD1~T. 
Tag Jurkat cells (Clipstone et al. (1992) Nature 357, 695-7) were transfected with pFJ5 
(vector alone) or pBJ5/iffi>l-F (encoding COOH-terminal FLAG epitope tagged HDJ) 
by electroporation and detergent lysates were prepared [0.5% Triton X-100, 50 mM tris 
25 (pH 8), 100 mM NaCl, 10% glycerol]. Anti-FLAG antibodies conjugated to agarose 
beads (IBI) were used to immunoprecipitate recombinant HDJ in the presence or absence 
of FLAG peptide competitor, and histone deacetylase activity was measured as described 
above. 

Figure 4D shows the interaction between recombinant HDl-F and the K-trap 
affinity matrix. Lysates fi-om Jurkat ceUs transfected with pBJ5/HDl-F were incubated 
with the K-trap affinity matrix in the presence or absence of inhibitors. Immunoblots of 
the eluted proteins were probed with the anti-FLAG M2 monoclonal antibody (IBI). 

Figures 5A and 5B are sequence alignments for various HDx and /fZ)x-related 
cDNAs and proteins, respectively. 
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Figure 6 depicts exemplary specificity elements (A), linker elements (B), and 
electrophilic moieties (C) for generating compounds which are capable of reacting A^th a 
nucleophilic moiety oi^HDx protein. 

Figure 7 illustrates an exemplary synthesis of trichostatin analogs. 

Figures 8A-8C illustrate a synthesis of tritiated Trapoxin B. 

Figures 9A-9C depict a synthesis of the K-trap and K-trap affinity matrix . 

Figures lOA-lOB: mSin3A is present in cells as a large stable multiprotein 
complex. Nuclear lysates were prepared from U937 cells metabolically labeled with 
[35]S-methionine and low stringency immunoprecipitations performed with antiserum 
specific for mSin3A. "+ block" shows proteins immunoprecipitated when the anti- 
mSin3A was preincubated with purified GST-PAH2 (A). In (B), low stringency mSin3 A 
immunoprecipitates were washed for an additional 60 minutes using the salt and 
detergent conditions indicated at the top of the Figure. In (A) and (B), the 
immunoprecipitates were analyzed by SDS-PAGE and autoradiography. Apparent 
molecular weight of the coprecipitating proteins and the sizes of the molecular weight 
markers are given in kilodaltons. 

Figures llA-D: mSin3A and EMACI associate in vivo. Immunoprecipitations 
were performed using nuclear extracts fi-om [35]S-methionine labeled U937 cells. (A) The 
left lane shows proteins fi-om an anti-mSin3A immunoprecipitate. The right lane shows 
proteins eluted fi-om an anti-mSin3A immunoprecipitate and reprecipitated with anti- 
serum specific for HDl. In (B) and (C), low stringency immunoprecipitations were 
performed using antiserum specific for the carboxy-terminus of HDl. "+ block" indicates 
that the HDl antiserum was preincubated with the inununizing peptide. In (C), proteins 
immunoprecipitated with anti-mS3A are shown for reference, proteins eluted fi-om a low 
stringency anti-HDl immunoprecipitate and reprecipitated with anti-mSin3A are shown 
in the right most lane. In (A), (B). and (C), autoradiographs of SDS-PAGE gels are 
shown. Apparent molecular weight of the coprecipitating, proteins and the sizes of the 
molecular weight markers are given in kilodaltons. In (D), in vitro histone deacetyiase 
activity in anti-mSin3A immunoprecipitates is shown. Human Jurkat cell extracts (12 
mg) were immunoprecipitated using anti-mSin3A polyclonal antibodies, "+block" 
mdicates that the anti-mSin3A antibody was preincubated with GST-PAH2, "+10 nM 
trapoxin" indicates that the immunoprecipitated proteins wef-e pretreated with 10 nM 
trapoxin for 30 minutes at 4°C prior to being assayed for histone deacetyiase activity. 
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Figures 12A-C: RbAp48 is associated with mSinSA in vivo and recombinant 
HDl, RbAp48 and mSinSA copurify from insect cell extracts, (A) TAg Jurkat cell 
lysates were immunoprecipitated using antibodies specific to C-terminalal portion of HDl 
(left) or antibodies specific to PAH2 of mSin3A (right). Parallel immunoprecipitations 
5 were blocked as described in figure 1 1. Immunopurified proteins were analyzed by SDS- 
PAGE and inununobiotted with (x-RbAp48 monoclonal antibody 12B1. (B & C) Equal 
amounts of baculovirus coinfected Sf9 cell extracts were affinity purified using Ni2+- 
agarose "Ni" or or x-FLAG-M2-agarose "F". Purified recombinant proteins were 
analyzed by SDS-PAGE, transferred to Immobilon-P (Millipore) and inununobiotted with 
10 FLAG to detect HDl-F (B) or (x-Flu (12CA.5) to detect p48-HA (C). We observe a 
reduction in expression of HDI-F and p48-HA when coexpressed with mSin3A. 

Figures 13A-C: Trapoxin reverses transcriptional repression by mSinSA, (A) 
The structure of the minimal reporter gene derived fi-om the myelomonocytic growth 
factor gene and the expression vectors. Mad(Pro)N35GALVPI6 has leucine at position 

15 12 and alanine at position 16 mutated to proline as indicated. These point mutations 
prevent association between mSin3A and Mad (Ayer et al., 1995). The transcriptional 
activity of MadN35GALVP16 and Mad(Pro)N35GALVP16 was determined by 
measuring luciferase activity (Relative Light Units, RLU) of transfected 293 cells 
following an 8 hour treatment with 0 (solid bars) or 10 nM trapoxin (striped bars) (B). 

20 To control for differences in transfection efficiency, the RLU values were normalized to 
the P-galactosidase activity produced by a cotransfected CMV-PGAL construct. Shown 
is data from representative experiment and the error is reported as the standard error of 
the mean (s.e.m). This experiment has been done a minimum of five times in triplicate 
wdth similar results. An 8 hour treatment of 293 cells with 10 nM trapoxin is within the 

25 linear range of the response of the reporter gene. Furthermore, trapoxin treatment did 
not prevent association between mSin3A and HDl (data not shown). (C), trapoxin 
inhibits histone deacetylase activity of human 293 cells in vivo. 2 x 10^ cells were 
cultured for 8 hours in the absence, "O", or in the presence of 10 nM trapoxin. Cells 
were harvested and crude extracts from approximately 1 x 10^^ cells (solid bars) or anti- 

30 HDl immunoprecipitations of extracts from approximately 4 x 10^^ cells (white bars) 
were assayed for histone deacetylase activity in vitro. 
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Detailed Description of the Invention 

The positioning of nucleosomes relative to particular regulatory elements in 
genomic DNA has emerged as a mechanism for managing the association of sequence- 
specific DNA-binding proteins with promoters, enhancers and other transcriptional 
regulatory sequences. Two modifications to nucleosomes have been observed to 
influence the association of DNA-binding proteins with chromatin. Depletion of histones 
H2A/H2B fi-om the nucleosome facilitates the binding of RNA polymerase II (Baer et al. 
(1983) Nature 301:482-488) and TFIIIA (Hayes et al. (1992) PNAS 89:1229-1233). 
Likewise, acetylation of the core histones apparently destabUizes the nucleosome and is 
thought to modulate the accessibility of transcription factors to their respective enhancer 
and promoter elements (Oliva et al. (1990) Nuc Acid Res \'&:rTi9-TlAl; and Walker et 
al. (1990) J Biol Chem 265:5622-5746). In both cases, overall histone-DNA contacts 
are altered. 

In one aspect, the present invention concerns the discovery of a family of genes in 
mammals, the gene products of which are referred to herein as "histone deacetylases" or 
"HDx!s". Experimental evidence indicates a fiinctional role for the HDx gene products as 
catalysts of the deacetylation of histones in mammalian cells, and accordingly play a role 
in determining tissue fate and maintenance. For instance, the results provided below 
indicate that proteins encoded by the HDx genes may participate, under various 
20 circumstances, in the control of proliferation, differentiation and cell death. 

The family of HDx gene apparently encode at least three different sub-families, 
e.g., paralogs, and have been identified fi-om the cells of various mammals. The HDx 
proteins were first isolated fi-om bovine thymus nuclei by use of a binding assay which 
exploited the ability of trapoxin, an inhibitor of histone deacetylase activity, to isolate 
proteins which co-purified with a histone acetylase activity. The partial identity of the 
isolated proteins were determined by peptide microsequencing, and primers based on the 
peptide sequences were used to clone human cDNAs fi-om a T cell library. One of the 
HDx gene products described below is referred to herein as HDI, and is represented in 
SEQ ID No. 1 (nucleotide) and SEQ ID No. 5 (amino acid). 

A search of expressed sequence tag (EST) libraries turned up partial sequences 
for human HDx transcripts, and revealed the existence of at least two other human HDx 
genes related to HDJ, these other paralogs referred to herein as HD2 and HD3. 
Nucleotide and amino acid sequences for partial clones of other human HDx homologs 
are provided by SEQ ID Nos. 2-4 and 6-8. respectively. 
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Analysis of the HDx sequences indicated no obvious similarities with any 
previously identified domains or motifs. However, the fact that each full-length clone 
lacks a signal sequence, along with the observation that proteins can be detected in the 
nucleus, indicates that the HDx genes encode intracellular proteins. 

5 Careful inspection of the HDx clones suggests at least two novel motifs, one or 

both of which may be characteristic of at least subfamilies of the mammalian HDx family. 
The first apparently conserved structural element of the HDx family occurs in the N- 
terminal portion of the molecule, and is designated herein as the "v motif*. With 
reference to huxmn HDl, the v motif corresponds to amino acid residues Aspl30- 
10 Phel98. By alignment of the human HDx sequences, the element is represented by the 
consensus sequence: 

dxxx>dcx:gglhhakkxeasgfcyxndivxxe<^ 

VMTXSF, (SEQ ID No. 12) 

more preferably by the consensus sequence: 

1 5 DIAXimVAGGLHHAKKX2EASGFCYVNDIVX3X4lLELLKYHX5RVL^ 
7TD-RVMTVSF (SEQ ID No. 13) 

wherein each of represents any single amino acid, though more preferably represents 
an amino acid residue in the corresponding human HDx sequences of the appended 
sequence listing. 

20 A second motif, herein designated the % motif is represented by the consensus 

sequence: 

C\OCXXKXFXXPXXXXGGGGYTXRNVARXWX>^ (SEQ ID No. 14) 
more preferably by the consensus sequence: 

CVEX3VKX9FNX10PLLX1 iLGGGGYTXi2RNVARCWTYET (SEQ ID No: 15) 

25 wherein each of represents any single amino acid, though more preferably represents 
an amino acid residue in the corresponding human HDx sequences of the appended 
sequence listing. The % motif can be found in the human HDl sequence at C284-Thr3 16. 

The family of HDx proteins apparently ranges in size fi-om about 40kd to about 
60k:d for the unmodified polypeptide chain. For instance, the bovine HDI protein 
30 migrates on an SDS-PAGE (9%) gel with an apparent molecular weight of 46kD. The 
human HDJ amino acid sequence predicts a molecular weight for the polypeptide chain 
of55kD. 
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Accordingly, certain aspects of the present invention relate to nucleic acids 
encoding HDx proteins, the HDx proteins themselves, antibodies immunoreactive with 
HDx proteins, and preparations of such compositions. Moreover, the present invention 
provides diagnostic and therapeutic assays and reagents for detecting and treating 
disorders involving, for example, aberrant expression (or Joss thereof) of HDx homologs. 
In addition, drug discovery assays are provided for identifying agents which can modulate 
the biological function o£HDx proteins, such as by altering the binding of HDx molecules 
to either proteins or nucleic acids. Such agents can be useful therapeutically to alter the 
growth and/or differentiation of a cell. Other aspects of the invention are described 
below or will be apparent to those skilled in the art in light of the present disclosure. 

Analysis of the human HDx sequences, while not revealing any obvious 
similarities to known domains or motifs, did indicate similarities with previously identified 
proteins from both Saccharomyces cerevisiae and Xenopus laevis. Those genes, RPD3 
(SEQ ID No. 9) and Xe-RPD3 (SEQ ID No. 10), respectively, had not previously been 
ascribed any specific function. However, based on our observations for the function of 
HDl, it is now apparent that each of these other proteins are also deacetylases, and 
represent potential therapeutic targets. Accordingly, drug discovery assays are provided 
for identifying agents which can modulate the biological function of "//Z)x-related" 
proteins, such as RPD3 homologs, by altering the enzymatic activity of the deacetylase, 
or its binding to other cellular components including homologs of RbAp48 (described 
infra). Such agents can be useful therapeutically to alter the growth and/or differentiation 
of non-human cells, such as in the treatment of mycotic infections, or as additives to 
livestock feed, e.g., to promote weight gain, or as topical antiseptics for sterilizing 
medical equipment. 

In addition we isolated another bovine protein having an approximate molecular 
size of 50kD which apparently binds HDx proteins isolated by the trapoxin matrix, and 
microsequencing of that protein demonstrated that it was related to the protein referred 
to in the art as RbAp48 (Qian et al. (1993) Nature 364:648; SEQ ID No. 1 1). RbAp48 
was originally identified as a protein that binds to the retinoblastoma (Rb) gene product. 
The retinoblastoma (RB) gene product plays a role in tumor suppression (Weinberg, 
R-A., (Sept 1988) Scientific Amer.pp 44-51; Hansen et al. (1988) Trends Genet 4:125- 
128). The role of RB as a tumor-suppressor protein in cell-cycle control is believed to be 
similar to that of another tumor-suppressor, p53 (Green (1989) Cell 56:1-3; Mowat et al 
(1985 iVa/ttre 314:633-636). Inactivation or mutation of the second RB allele in one of 
the somatic cells of these susceptible individuals appears to be the molecular event that 
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leads to tumor formation (Caveneee et al. (1983) Nature 305:799-784; Friend et al. 
(1987) PNAS 84:9059-9063). 

The growth suppression function of the retinoblastoma protein is thought to be 
mediated by Rb binding to cellular proteins. RbAp48 is one of the major proteins that 
5 binds to a putative functional domain at the carboxy terminus of the Rb protein. 
Complex formation between RbAp48 and Rb occurs in vitro and in vivo, and apparently 
involves direct interaction between the proteins. Like Rb, RbAp48 is a ubiquitously 
expressed nuclear protein. RbAp48 share sequence homology with MSII, a negative 
regulator of the Ras-cyclic AMP pathway in the yeast Saccharomyces cerevisiae, 
10 Furthermore, like MSIl, human RbAp48 suppresses the heat-shock sensitivity of the 
yeast iral strains and RAS2ValJ9 strains. Interaction v^th RbAp48 may be one of the 
mechanisms for suppression of growth mediated by Rb. Accordingly, the interaction of 
RbAp48 with HDx proteins further implicates the HDx proteins in cell-cycle regulation. 

The RpAp48 interaction with //Z^jc and ^Z)x-related proteins represents yet 
15 another therapeutic target. Accordingly, drug discovery assays are provided for 
identifying agents which can modulate the interaction of RbAp48 proteins and the like 
with ^TOx-related proteins. Such assays can be derived to detect the ability of a test 
agent to alter protein-protein contacts, or to alter the enzymatic activity of the 
deacetylase in complexes including an RbAp48 protein (e.g., were such complexes 
20 allosterically modulate the HDx enzymatic activity). As above, such agents can be useful 
therapeutically to alter the growth and/or differentiation of cells. 

Members of the Mad family of BHLHZip proteins heterodimerize with Max to 
repress transcription in a sequence-specific manner. Transcriptional repression by 
Mad:Max heterodimers is mediated by ternary complex formation with either of the 

25 corepressors mSin3A or mSinSB. Example 3 demonstrates that Sin3 proteins are an in 
vivo component of large, heterogeneous multiprotein complexes and is tightly and 
specifically associated with at least seven polypeptides. Two of the Sin3 -associated 
proteins, p50 and p55, are members of the histone deacetylase family described herein. 
Sin3 immunecomplexes possess histone deacetylase activity that is sensitive to the 

30 specific deacetylase inhibitor trapoxin. Sin3 targeted repression of a reporter gene is 
reduced by trapoxin treatment, suggesting that histone deacetylation mediates 
transcriptional repression through Mad-Max-Sin3A multimeric complexes. 

The Sin3 interaction with HDx and JTOx-related proteins represents still another 
therapeutic target. Thus, in one aspect of the present invention there is provided drug 
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discovery assays for identifying agents which can modulate the interaction of Sin3 
proteins and the like with /^Dx-related proteins. Such assays can be derived to detect the 
ability of a test agent to alter protein-protein contacts, or to alter the enzymatic activity 
of the deacetylase in complexes including Sin3 or other transcriptional regulatory 
5 proteins. As above, such agents can be useful therapeutically to alter the growth and/or 
differentiation of cells. 

For convenience, certain terms employed in the specification, examples, and 
appended claims are collected here. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
10 deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The 
term should also be understood to include, as equivalents, analogs of either RNA or 
DNA made from nucleotide analogs, and, as applicable to the embodiment being 
described, single (sense or antisense) and double-stranded polynucleotides. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
comprising an open reading frame encoding one of the HDx polypeptides of the present 
mvention, including both exon and (optionally) intron sequences. A "recombinant gene" 
refers to nucleic acid encoding an HDx polypeptide and comprising /TOac-encoding exon 
sequences, though it may optionally include intron sequences which are either derived 
from a chromosomal HDx gene or from an unrelated chromosomal gene. Exemplary 
20 recombinant genes encoding the subject HDx polypeptide are represented in the 
appended Sequence Listing. The term "intron" refers to a DNA sequence present in a 
given HDx gene which is not translated into protein and is generally found between 
exons. 

As used herein, the term "transfection" means the introduction of a nucleic acid, 
e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 
"Transformation", as used herein, refers to a process in which a ceU's genotype is 
changed as a result of the cellular uptake of exogenous DNA or RNA, and. for example, 
the transformed cell expresses a recombinant form of an i^Dx polypeptide or, where anti- 
sense expression occurs from the transferred gene, the expression of a naturally- 
30 occurring form of the HDx protein is disrupted. 

As used herein, the term "specifically hybridizes" refers to the ability of the 
probe/primer of the invention to hybridize to at least 15 consecutive nucleotides of an 
HDx gene, such as an HDx sequence designated in one of SEQ ID Nos: 1-4, or a 
sequence complementary thereto, or naturally occurring mutants thereof, such that it has 



25 



wo 97/35990 



PCT/US97/05275 



-20- 



20 



less than 150/0, preferably less than 10%, and more preferably less than 5% background 
hybnd.zat.on to a cellular nucleic acid (e.g.. mRNA or genomic DNA) encoding a protein 
other than an HDx protein, as defined herein. In preferred embodiments the 
ohgonucleotide probe specifically detects only one of the subject HDx paralogs'eg 
> does not substantially hybridize to transcripts for other HDx homologs in the' same 
species. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transportmg another nucleic acid to which it has been linked. One type of preferred 
vector .s an ep.some, i.e., a nucleic acid capable of extra-chromosomal replication 
Preferred vectors are those capable of autonomous replication and/expression of nucleic 
ac.ds to which they are linked. Vectors capable of directing the expression of genes to 
which they are operatively linked are referred to herein as "expression vectors" In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of "plasmids" which refer generally to circular double stranded DNA loops which, 
m the.r vector fom. are not bound to the chromosome. In the present specification, 
"plasm.d" and "vector" are used interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is intended to include such other forms of 
express.on vectors which serve equivalent functions and which become known in the art 
subsequently hereto. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
spec.ficat.on to refer to DNA sequences, such as initiation signals, enhancers and 
promoters, which induce or control transcription of protein coding sequences with which 
they are operabiy linked. In preferred embodiments, transcription of one of the 
recombinant HDx genes is under the control of a promoter sequence (or other 
transcnpt.onal regulatory sequence) which controls the expression of the recombinant 
gene .n a cell-type in which expression is intended. It will also be understood that the 
recombmant gene can be under the control of transcriptional regulator/ sequences which 
are the same or which are different fi^om those sequences which control transcription of 
the naturally-occurring forms of HDx genes. 

As used herein, the term "tissue-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence operabiy 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue, such as cells of hepatic, pancreatic, neuronal or hematopoietic 
ongm. The term also covers so-called "leaky" promoters, which regulate expression of a 
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selected DNA primarily in one tissue, but can cause at least low level expression in other 
tissues as well. 

As used herein, a "transgenic animal" is any animal, preferably a non-human 
mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 
techniques well known in the art. The nucleic acid is introduced into the cell, directly or 
indirectly by introduction into a precursor of the cell, by way of deliberate genetic 
manipulation, such as by microinjection or by infection with a recombinant vims. The 
term genetic manipulation does not include classical cross-breeding, or in vitro 
fertilization, but rather is directed to the introduction of a recombinant DNA molecule. 
This molecule may be integrated within a chromosome, or it may be extrachromosomaliy 
replicating DNA. In the typical transgenic animals described herein, the transgene causes 
cells to express a recombinant form of one of the HDx proteins, e.g. either agonistic or 
antagonistic forms. However, transgenic animals in which the recombinant HDx gene is 
silent are also contemplated, as for example, the FLP or CRE recombinase dependent 
constructs described below. Moreover, "transgenic animal" also includes those 
recombinant animals in which gene disruption of one or more HDx genes is caused by 
human intervention, including both recombination and antisense techniques. 

The "non-human animals" of the invention include vertebrates such as rodents, 
non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred non- 
human animals are selected from the rodent family including rat and mouse, most 
preferably mouse, though transgenic amphibians, such as members of the Xenopus genus, 
and transgenic chickens can also provide important tools for understanding and 
identifying agents which can affect, for example, embiyogenesis and tissue formation. 
The invention also contemplates transgenic insects, including those of the genus 
Drosophila, such as D. melanogaster. The term "chimeric animal" is used herein to refer 
to animals in which the recombinant gene is found, or in which the recombinant is 
expressed in some but not all cells of the animal. The term "tissue-specific chimeric 
animal" indicates that one of the recombinant HDx genes is present and/or expressed or 
30 disrupted in some tissues but not others. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, 
e.g., one of the/tt>x polypeptides, or pending an antisense transcript thereto), which is 
partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it 
is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell 
35 into which it is introduced, but which is designed to be inserted, or is inserted, into the 
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animal's genome in such a way as to alter the genome of the cell into which it is inserted 
(e.g., it is inserted at a location which differs from that of the natural gene or its insertion 
results in a knockout). A transgene can include one or more transcriptional regulatory 
sequences and any other nucleic acid, such as introns, that may be necessary for optimal 
5 expression of a selected nucleic acid. 

As is well known, genes for a particular polypeptide may exist in single or 
multiple copies within the genome of an individual. Such duplicate genes may be identical 
or may have certain modifications, including nucleotide substitutions, additions or 
deletions, which all still code for polypeptides having substantially the same activity. The 
10 term "DNA sequence encoding an HDx polypeptide" may thus refer to one or more 
genes within a particular individual. Moreover, certain differences in nucleotide 
sequences may exist between individuals of the same species, which are called alleles. 
Such allelic differences may or may not result in differences in amino acid sequence of the 
encoded polypeptide yet still encode a protein with the same biological activity. 

15 "Homology" refers to sequence similarity between two peptides or between two 

nucleic acid molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, then the molecules are 
homologous at that position. A degree of homology between sequences is a function of 

20 the number of matching or homologous positions shared by the sequences. An 
"unrelated" or "non-homologous" sequence shares less than 40 percent identity, though 
preferably less than 25 percent identity, with one of the HDx sequences of the present 
invention. 

As used herein, an "iZDx-related" protein refers to the HDx proteins described 
25 herein, and other human homologs of those HDx sequences, as well as orthologs and 
paralogs (hoitiologs) of the HDx proteins in other species, ranging from yeast to other 
mammals, e.g., homologous histone deacetylase. The term "ortholog" refers to genes or 
proteins which are homologs via speciation, e.g., closely related and assumed to have 
common descent based on structural and functional considerations. Orthologous proteins 
30 function as recognizably the same activity in different species. The term "paralog" refers 
to genes or proteins which are homologs via gene duplication, e.g., duplicated variants of 
a gene within a genome. See also, Fritch, WM (1970) Syst Zool 19:99-113. 

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably 
herein. It is understood that such terms refer not only to the particular subject ceil but to 
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the progeny or potential progeny of such a cell. Because certain modifications may occur 
in succeeding generations due to either mutation or environmental influences, such 
progeny may not, in fact, be identical to the parent cell, but are still included within the 
scope of the term as used herein. 

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence 
encoding one of the subject HDx polypeptides with a second amino acid sequence 
defining a domain (e.g. polypeptide portion) foreign to and not substantially homologous 
with any domain of one of the HDx proteins. A chimeric protein may present a foreign 
domain which is found (albeit in a different protein) in an organism which also expresses 
the first protein, or it may be an "interspecies", "intergenic", etc. fiision of protein 
structures expressed fay diflferent kinds of organisms. In general, a fiision protein can be 
represented by the general formula ^-HDx-Y, wherein HDx represents a portion of the 
protein which is derived fi-om one of the HDx proteins, and X and Y are, independently, 
absent or represent amino acid sequences which are not related to one of the HDx 
1 5 sequences in an organism. 

The term "isolated" as also used herein with respect to nucleic acids, such as 
DNA or RNA, refers to molecules separated fi-om other DNAs, or RNAs, respectively, 
that are present in the natural source of the macromolecule. For example, an isolated 
nucleic acid encoding one of the subject HDx polypeptides preferably includes no more 
than 10 kilobases (kb) of nucleic acid sequence which naturally immediately flanks the 
HDx gene in genomic DNA, more preferably no more than 5kb of such naturally 
occurring flanking sequences, and most preferably less than 1.5kb of such naturally 
occurring flanking sequence. The term isolated as used herein also refers to a nucleic 
acid or peptide that is substantially fi-ee of cellular material, viral material, or culture 
medium when produced by recombinant DNA techniques, or chemical precursors or 
other chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is 
meant to include nucleic acid fi-agments which are not naturally occurring as fi-agments 
and would not be found in the natural state. 

As described below, one aspect of the invention pertains to isolated nucleic acids 
comprising nucleotide sequences encoding HDx polypeptides, and/or equivalents of such 
nucleic acids. The term nucleic acid as used herein is intended to include fi-agments as 
equivalents. The term equivalent is understood to include nucleotide sequences encoding 
fiinctionally equivalent HDx polypeptides or fimctionally equivalent peptides having an 
activity of an HDx protein such as described herein. Equivalent nucleotide sequences will 
include sequences that differ by one or more nucleotide substitutions, additions or 
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deletions, such as allelic variants; and will, therefore, include sequences that differ from 
the nucleotide sequence of the HDx cDNA sequences shown in any of SEQ ID Nos:l-4 
due to the degeneracy of the genetic code. Equivalents will also include nucleotide 
sequences that hybridize under stringent conditions (i.e., equivalent to about 20-27°C 
5 below the melting temperature (T^) of the DNA duplex formed in about IM salt) to the 
nucleotide sequences represented in one or more of SEQ ID Nos:l-4. In one 
embodiment, equivalents will further include nucleic acid sequences derived from and 
evolutionarily related to, a nucleotide sequences shovm in any of SEQ ID Nos; 1-4. 

Moreover, it v^U be generally appreciated that, under certain circumstances, it 
10 may be advantageous to provide homologs of one of the subject HDx polypeptides which 
function in a limited capacity as one of either an HDx agonist (mimetic) or an HDx 
antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological eflFects can be elicited 
by treatment with a homolog of limited function, and v^th fewer side eflFects relative to 
15 treatment v^th agonists or antagonists which are directed to all of the biological activities 
of naturally occurring forms of HDx proteins. 

Homologs of each of the subject HDx proteins can be generated by mutagenesis, 
such as by discrete point mutation(s), or by truncation. For instance, mutation can give 
rise to homologs which retain substantially the same, or merely a subset, of the biological 

20 activity of the HDx polypeptide from which it was derived. Alternatively, antagonistic 
forms of the protein can be generated which are able to inhibit the function of the 
naturally occurring form of the protein, such as by competitively binding to an HDx 
substrate or HDx associated protein, as for example competing with wild-type HDx in the 
binding of RbAp48 or a histone. In addition, agonistic forms of the protein may be 

25 generated which are constitutively active, or have an altered K^at or for deacetylation 
reactions. Thus, the HDx protein and homologs thereof provided by the subject 
invention may be either positive or negative regulators of transcription and/or replication. 

In general, polypeptides referred to herein as having an activity of an HDx protein 
(e.g., are "bioactive") are defined as polypeptides which include an amino acid sequence 

30 corresponding (e.g., identical or homologous) to all or a portion of the amino acid 
sequences of an HDx proteins shown in any one or more of SEQ ID Nos: 5-8 and which 
mimic or antagonize all or a portion of the biological/biochemical activities of a naturally 
occuning HDx protein. Examples of such biological activity include the ability to 
modulate proliferation of cells. For example, inhibiting histone deacetylation causes cells 

35 to arrest in Gl and G2 phases of the cell cycle. The biochemical activity associated with 
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HDx proteins of the present invention can also characterized in terms of binding to and 
(optionally) catalyzing the deacetylation of an acetylated histone. Another biochemical 
property of certain, of the subject HDx proteins involves binding to other cellular 
proteins, such as RbAp48 or Sin3A. 

Other biological activities of the subject HDx proteins are described herein or will 
be reasonably apparent to those skilled in the art. According to the present invention, a 
polypeptide has biological activity if it is a specific agonist or antagonist of a naturally- 
occurring form of an HDx protein. 

Preferred nucleic acids encode an HDx polypeptide comprising an amino acid 
sequence at least 80% homologous, more preferably at least 85% homologous and most 
preferably at least 88% homologous with an amino acid sequence of a human HDx, e.g., 
such as selected from the group consisting of SEQ ID Nos: 5-8. Nucleic acids which 
encode polypeptides at least about 90%, more preferably at least about 95%, and most 
preferably at least about 98-99% homology with an amino acid sequence represented in 
one of SEQ ID Nos:5-8 are of course also within the scope of the invention, as are 
nucleic acids identical in sequence with any of the enumerated HDx sequences of the 
sequence listing. In one embodiment, the nucleic acid is a cDNA encoding a polypeptide 
having at least one activity of the subject HDx polypeptide. 

In certain preferred embodiments, the invention features a purified or recombinant 
HDx polypeptide having peptide chain with a molecular weight in the range of 40k:d to 
60kd, even more preferably in the range of 45-50 kd or 53-58kd. It will be understood 
that certain post-translational modifications, e.g., phosphorylation and the like, can 
increase the apparent molecular weight of the HDx protein relative to the unmodified 
polypeptide chain, and cleavage of certain sequences, such as pro-sequences, can likewise 
25 decrease the apparent molecular weight. 

In other preferred embodiments, the nucleic acid encodes an HDx polypeptide 
which includes both the v and x motifs, and preferably possess a histone deacetylase 
activity. For example, preferred HDx proteins are represented by the general formula A- 
(v motif)-B-(x motif)-C, wherein the v motif is an amino acid sequence represented in 
SEQ ID No. 12, more preferably SEQ ID No. 13, the x motif is an amino acid sequence 
represented in SEQ ID No. 14, more preferably SEQ ID No. 15, and A, B and C 
represent amino acid sequences which are correspond to HDx or iZDx-related proteins. 

Still other preferred nucleic acids of the present invention encode an HDx 
polypeptide which includes a polypeptide sequence corresponding to all or a portion of 
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amino acid residues of any one of SEQ ID Nos: 5-8, e.g., at least 5, 10, 25, 50 or 100 
amino acid residues of that region. 

Another aspect of the invention provides a nucleic acid which hybridizes under 
high or low stringency conditions to the nucleic acid represented by SEQ ID No: 1. 
5 Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45^C, followed by a wash of 2.0 x SSG at 
50^C, are known to those skilled in the art or can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt 
concentration in the wash step can be selected from a low stringency of about 2.0 x SSC 
10 at 50'*C to a high stringency of about 0.2 x SSC at SO^'C. In addition, the temperature in 
the wash step can be increased from low stringency conditions at room temperature, 
about 22^C, to high stringency conditions at about 65°C. 

Nucleic acids, having a sequence that differs from the nucleotide sequences 
shown in one of SEQ ID Nos: 1-4 due to degeneracy in the genetic code are also within 

15 the scope of the invention. Such nucleic acids encode fiinctionally equivalent peptides 
(i.e., a peptide having a biological activity of an HDx polypeptide) but diflfer in sequence 
from the sequence shown in the sequence listing due to degeneracy in the genetic code. 
For example, a number of amino acids are designated by more than one triplet. Codons 
that specify the same amino acid, or synonyms (for example, CAU and CAC each encode 

20 histidine) may result in "silent" mutations which do not affect the amino acid sequence of 
an HDx polypeptide. However, it is expected that DNA sequence polymorphisms that do 
lead to changes in the amino acid sequences of the subject HDx polypeptides will exist 
among, for example, humans. One skilled in the art will appreciate that these variations 
in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids 

25 encoding polypeptides having an activity of an HDx polypeptide may exist among 
individuals of a given species due to natural allelic variation. 

As used herein, an HDx gene fragment refers to a nucleic acid having fewer 
nucleotides than the nucleotide sequence encoding the entire mature form of an HDx 
protein yet which (preferably) encodes a polypeptide which retains some biological 
30 activity of the fiiU length protein. Fragment sizes contemplated by the present invention 
include, for example, 5, 10, 25, 50, 75, 100, or 200 amino acids in length. 

As indicated by the examples set out below, HDx protein-encoding nucleic acids 
can be obtained from mRNA present in any of a number of eukaryotic cells. It should 
also be possible to obtain nucleic acids encoding HDx polypeptides of the present 
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invention from genomic DNA from both adults and embryos. For example a gene 
encodmg an HDx protein can be cloned from either a cDNA or a genomic library in 
accordance with protocols described herein, as well as those generally Vno^ to persons 
skilled in the art. A cDNA encoding an HDx protein can be obtained by isolating total 
mRNA from a cell, e.g. a mammalian ceil, e.g. a human cell, including embryonic cells 
Double stranded cDNAs can then be prepared from the total mRNA and subsequently 
mserted mto a suitable plasmid or bacteriophage vector using any one of a number of 
known techniques. The gene encoding an HDx protein can also be cloned using 
estabhshed polymerase chain reaction techniques in accordance with the nucleotide 
sequence information provided by the invention. The nucleic add of the invention can be 
DNA or RNA. A preferred nucleic acid is a cDNA including a nucleotide sequence 
represented by one of SEQ ID Nos: 1-4. 

Another aspect of the invention relates to the use of the isolated nucleic acid in 
"antisense" therapy. As used herein, "antisense" therapy refers to administration or in 
situ generation of oligonucleotide probes or their derivatives which specifically hybridize 
(e.g. binds) under cellular conditions, with the cellular mRNA and/or genomic DNA 
encoding one or more of the subject HDx proteins so as to inhibit expression of that 
protein, e.g. by inhibiting transcription and/or translation. The binding may be by 
conventional base pair complementarity, or, for example, in the case of binding to DNA 
duplexes, through specific interactions in the major groove of the double helix. In 
general, "antisense" therapy refers to the range of techniques generally employed in the 
art, and includes any therapy which relies on specific binding to oligonucleotide 
sequences. 

An antisense construct of the present invention can be delivered, for example, as 
an expression plasmid which, when transcribed in the ceU, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes an HDx 
protein. Alternatively, the antisense construct is an oligonucleotide probe which is 
generated ex vivo and which, when introduced into the cell causes inhibition of 
expression by hybridizing' with the mRNA and/or genomic sequences of an HDx gene 
Such oligonucleotide probes are preferably modified oligonucleotides which are resistant 
to endogenous nucleases, e.g. exonucleases and/or endonucleases, and are therefore 
stable in vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 
phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U S 
Patents 5,176.996; 5,264,564; and 5.256,775), or peptide nucleic acids (PNAs) 
Additionally, general approaches to constructing oligomers useful in antisense therapy 
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have been reviewed, for example, by Van der Krol et al. (1988) Biotechniques 6:958- 
976; and Stein et al. (1988) Cancer Res 48:2659-2668. 

Accordingly, the modified oligomers of the invention are useful in therapeutic, 
diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized 
5 in a manner appropriate for antisense therapy in general. For such therapy, the oligomers 
of the invention can be formulated for a variety of routes of administration, including 
systemic and topical or localized administration. Techniques and formulations generally 
may be found in Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, 
PA- For systemic administration, injection is preferred, including intramuscular, 
10 intravenous, intraperitoneal, and subcutaneous. For injection, the oligomers of the 
invention can be formulated in liquid solutions, preferably in physiologically compatible 
buffers such as Hank's solution or Ringer's solution. In addition, the oligomers may be 
formulated in solid form and redissolved or suspended immediately prior to use, 
Lyophilized forms are also included. 

15 Systemic administration can also be by transmucosal or transdermal means, or the 

compounds can be administered orally. For transmucosal or transdermal administration, 
penetrants appropriate to the barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art, and include, for example, for transmucosal 
administration bile salts and ftisidic acid derivatives. In addition, detergents may be used 

20 to facilitate permeation. Transmucosal administration may be through nasal sprays or 
using suppositories. For oral administration, the ' oligomers are formulated into 
conventional oral administration forms such as capsules, tablets, and tonics. For topical 
administration, the oligomers of the invention are formulated into ointments, salves, gels, 
or creams as generally known in the art. 

25 In addition to use in therapy, the oligomers of the invention may be used as 

diagnostic reagents to detect the presence or absence of the target DNA or RNA 
sequences to which they specifically bind. Such diagnostic tests are described in further 
detail below. 

Likewise, the antisense constructs of the present invention, by antagonizing the 
30 normal biological activity of one of the HDx proteins, can be used in the manipulation of 
tissue, e.g. tissue differentiation or growth, both in vivo and ex vivo. 

Furthermore, the anti-sense techniques (e.g. microinjection of antisense 
molecules, or transfection with plasmids whose transcripts are anti-sense with regard to 
an HDx mKNA or gene sequence) can be used to investigate role of HDx in 
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developmental events, as well as the normal cellular function of HDx in adult tissue. 
Such techniques can be utilized in cell culture, but can also be used in the creation of 
transgenic animals (described infra). 

This invention also provides expression vectors containing a nucleic acid 
encoding an HDx polypeptide, operably linked to at least one transcriptional regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence in a manner which allows expression of the nucleotide sequence. 
Regulatory sequences are art-recognized and are selected to direct expression of the 
subject HDx proteins. Accordingly, the term transcriptional regulatory sequence includes 
promoters, enhancers and other expression control elements. Such regulatory sequences 
are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185. 
Academic Press, San Diego, CA (1990). For instance, any of a wide variety of 
expression control sequences, sequences that control the expression of a DNA sequence 
when operatively linked to it, may be used in these vectors to express DNA sequences 
encoding HDx polypeptides of this invention. Such useful expression control sequences, 
include, for example, a viral LTR, such as the LTR of the Moloney murine leukemia 
virus, the early and late promoters of SV40, adenovirus or cytomegalovirus immediate 
early promoter, the lac system, the trp system, the TAG or TRC system, T7 promoter 
whose expression is directed by T7 RNA polymerase, the major operator and promoter 
regions of phage X, the control regions for fd coat protein, the promoter for 3- 
phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, 
e.g., Pho5, the promoters of the yeast a-mating factors, the polyhedron promoter of the 
baculovirus system and other sequences known to control the expression of genes of 
prokaryotic or eukaryotic cells or their viruses, and various combinations thereof It 
should be understood that the design of the expression vector may depend on such 
factors as the choice of the host cell to be transformed and/or the type of protein desired 
to be expressed. Moreover, the vector's copy number, the ability to control that copy 
number and the expression of any other proteins encoded by the vector, such as antibiotic 
markers, should also be considered. In one embodiment, , the expression vector includes a 
recombinant gene encoding a peptide having an agonistic activity of a subject HDx 
polypeptide, or alternatively, encoding a peptide which is an antagonistic form of the 
HDx protein, such as a catalytically-inactive deacetylase. Such expression vectors can be 
used to transfect cells and thereby produce polypeptides, including fiision proteins, 
encoded by nucleic acids as described herein. 
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Moreover, the gene constructs of the present invention can also be used as a part 
of a gene therapy protocol to deliver nucleic acids, e.g., encoding either an agonistic or 
antagonistic form of one of the subject HDx proteins or an antisense molecule described 
above. Thus, another aspect of the invention features expression vectors for in vivo or in 
5 vitro transfection and expression of an HDx polypeptide or antisense molecule in 
particular cell types so as to reconstitute the function of, or alternatively, abrogate the 
function of JTOx-induced transcription in a tissue in which the naturally-occurring form of 
the protein is misexpressed; or to deliver a form of the protein which alters differentiation 
of tissue, or which inhibits neoplastic transformation. 

10 Expression constructs of the subject HDx polypeptides, as well as antisense 

constructs, may be administered in any biologically effective carrier, e.g. any formulation 
or composition capable of effectively delivering the recombinant gene to cells in vivo. 
Approaches include insertion of the subject gene in viral vectors- including recombinant 
retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus- 1, or 

15 recombinant bacterial or eukaryotic plasmids. Viral vectors transfect cells directly; 
plasniid DNA can be delivered with the help of, for example, cationic liposomes 
(lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin 
S, artificial viral envelopes or other such intracellular carriers, as well as direct injection 
of the gene construct or CaP04 precipitation carried out in vivo. It vwll be appreciated 

20 that because transduction of appropriate target cells represents the critical first step in 
gene therapy, choice of the particular gene delivery system will depend on such factors as 
the phenotype of the intended target and the route of administration, e.g. locally or 
systemically. Furthermore, it will be recognized that the particular gene construct 
provided for in vivo transduction of HDx expression are also useful for in vitro 

25 transduction of cells, such as for use in the ex vivo tissue culture systems described 
below. 

A preferred approach for in vivo introduction of nucleic acid into a cell is by use 
of a viral vector containing nucleic acid, e.g. a cDNA encoding the particular HDx 
polypeptide desired. Infection of cells with a viral vector has the advantage that a large 
30 proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are 
expressed efficiently in cells which have taken up viral vector nucleic acid. Retrovirus 
vectors, adenovirus vectors and adeno-associated virus vectors are exemplary 
recombinant gene delivery system for the transfer of exogenous genes in vivo. 
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particulariy into humans. These vectors provide efficient delivery of genes into cells, and 
the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 

In addition to viral transfer methods, such as those illustrated above, non-viral 
methods can also be employed to cause expression of a subject HDx polypeptide in the 
5 tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms 
used by mammalian cells for the uptake and intracellular transport of macromoiecules. In 
preferred embodiments, non-viral gene delivery systems of the present invention rely on 
endocytic pathways for the uptake of the subject HDx polypeptide gene by the targeted 
cell. Exemplary gene delivery systems of this type include liposomal derived systems, 
10 poly-lysine conjugates, and artificial viral envelopes. 

In clinical settings, the gene delivery systems for the therapeutic HDx gene can be 
introduced into a patient by any of a number of methods, each of which is familiar in the 
art. For instance, a pharmaceutical preparation of the gene delivery system can be 
introduced systemically, e.g. by intravenous injection, and specific transduction of the 

15 protein in the target cells occurs predominantly fi-om specificity of transfection provided 
by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional 
regulatory sequences controlling expression of the receptor gene, or a combination 
thereof In other embodiments, initial delivery of the recombinant gene is more limited 
with introduction into the animal being quite localized. For example, the gene delivery 

20 vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by stereotactic 
injection (e.g. Chen et al. (1994) PNAS 91: 3054-3057). AN HDx gene, such as any one 
of the clones represented in the group consisting of SEQ ID NO: 1-4, can be delivered in 
a gene therapy construct by electroporation using techniques described, for example, by 
Dev et al. ((1 994) Cancer Treat Rev 20: 105-11 5). 

25 The pharmaceutical preparation of the gene therapy construct can consist 

essentially of the gene delivery system in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery system can be produced intact fi-om recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which 

30 produce the gene delivery system. 

Another aspect of the present invention concerns recombinant forms of the HDx 
proteins. Recombinant polypeptides preferred by the present invention, in addition to 
native HDx proteins, are at least 80% homologous, more preferably at least 85% 
homologous and most preferably at least 88% homologous with an amino acid sequence 
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represented by any of SEQ ID Nos: 5-8. Polypeptides which possess an activity of an 
HDx protein (i.e. either agonistic or antagonistic), and which are at least 90%, more 
preferably at least 95%, and most preferably at least about 98-99% homologous with a 
sequence selected from the group consisting of SEQ ID Nos: 5-8 are also within the 
scope of the invention. In other preferred embodiments, the HDx polypeptide includes 
both the V and X motifs, and preferably possess a histone deacetylase activity. 

The term "recombinant HDx protein" refers to a polypeptide which is produced 
by recombinant DNA techniques, wherein generally, DNA encoding an/TOx polypeptide 
is mserted into a suitable expression vector which is in turn used to transform a host cell 
to produce the heterologous protein. Moreover, the phrase "derived from", with respect 
to a recombinant HDx gene, is meant to include within the meaning of "recombinant 
protein" those proteins having an amino acid sequence of a native HDx protein, or an 
amino acid sequence similar thereto which is generated by mutations including 
substitutions and deletions (including truncation) of a naturally occurring form of the 
15 protein. 

The present invention further pertains to recombinant forms of the subject HDx 
polypeptides which are encoded by genes derived from a mammal (e.g. a human), and 
which have amino acid sequences evolutionarily related to the HDx proteins represented 
in SEQ ID Nos: 5-8. Such recombinant HDx polypeptides preferably are capable of 
functioning in one of either role of an agonist or antagonist of at least one biological 
activity of a wild-type ("authentic") HDx protein of the appended sequence listing. The 
term "evolutionarily related to", with respect to amino acid sequences of HDx proteins, 
refers to both polypeptides having amino acid sequences which have arisen naturally, and 
also to mutational variants of HDx polypeptides which are derived, for example, by 
25 combinatorial mutagenesis. 

The present invention also provides methods of producing the subject HDx 
polypeptides. For example, a host ceU transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured 
under appropriate conditions to allow expression of the peptide to occur. The cells may 
be harvested, lysed and the protein isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. The 
recombinant HDx polypeptide can be isolated from cell culture medium, host cells, or 
both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 
immunoafiinity purification with antibodies specific for such peptide. In a preferred 
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embodiment, the recombinant HDx polypeptide is a fusion protein containing a domain 
which facilitates its purification, such as GST fusion protein or poly(His) fusion protein. 

This invention also pertains to a host cell transfected to express recombinant 
forms of the subject HDx polypeptides. The host cell may be any prokaryotic or 
eukaryotic cell. Thus, a nucleotide sequence derived from the cloning of HDx proteins, 
encoding all or a selected portion of a fiill-length protein, can be used to produce a 
recombinant form of an HDx polypeptide via microbial or eukaryotic cellular processes. 
Ligating the polynucleotide sequence into a gene construct, such as an expression vector, 
and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or 
mammalian) or prokaryotic (bacterial cells), are standard procedures used in producing 
other well-known proteins, e.g. MAP kinases, p53, WTl, PTP phosphatases, SRC, and 
the like. Similar procedures, or modifications thereof, can be employed to prepare 
recombinant HDx polypeptides by microbial means or tissue-culture technology in accord 
with the subject invention. 

The recombinant HDx genes can be produced by ligating nucleic acid encoding an 
HDx protein, or a portion thereof, into a vector suitable for expression in either 
prokaryotic cells, eukaryotic cells, or both. Expression vectors for production of 
recombinant fonms of the subject HDx polypeptides include plasmids and other vectors. 
For instance, suitable vectors for the expression oiznHDx polypeptide include plasmids 
of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived 
plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in 
prokaryotic cells, such as E. coli. 

A number of vectors exist for the expression of recombinant proteins in yeast. 
For instance, YEP24. YIP5, YEP51, YEP52, pYES2, and YRP17 are cloning and 
expression vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, 
for example. Broach et al. (1983) in Experimental Manipulation of Gene Expression] 
ed. M. Inouye Academic Press, p. 83, incorporated by reference herein). These vectors 
can replicate in E. coli due the presence of the pBR322 ori, and in S. cerevisiae due to 
the replication determinant of the yeast 2 micron plasmid. In addition, drug resistance 
30 markers such as ampicillin can be used. In an illustrative embodiment, an HDx 
polypeptide is produced recombinantly utilizing an expression vector generated by sub- 
cloning the coding sequence of one of the HDx genes represented in SEQ ID Nos: 1-4. 

The preferred mammalian expression vectors contain both prokaryotic sequences, 
to facilitate the propagation of the vector in bacteria, and one or more eukaryotic 
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transcription units that are expressed in eukaryotic cells. The pcDNAI/amp, 
pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, 
pSVT7, pko-neo and pHyg derived vectors are examples of mammalian expression 
vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified 
5 with sequences from bacterial plasmids, such as pBR322, to facilitate replication and 
drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively/ 
derivatives of viruses such as the bovine papillomavirus (BPV-1), or Epstein-Barr virus 
(pHEBo, pREP-derived and p205) can be used for transient expression of proteins in 
eukaryotic cells. The various methods employed in the preparation of the plasmids and 
10 transformation of host organisms are well known in the art. For other suitable expression 
systems for both prokaryotic and eukaryotic cells, as well as general recombinant 
procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 and 17. 

In some instances, it may be desirable to express the recombinant HDx 
15 polypeptide by the use of a baculovirus expression system. Examples of such baculovirus 
expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and 
pVL941), pAcUW-derived vectors (such as pAcUWl), and pBlueBac-derived vectors 
(such as the B-gal containing pBlueBac III). 

When it is desirable to express only a portion of an HDx protein, such as a form 
20 lacking a portion of the N-terminus, i.e. a truncation mutant which lacks the signal 
peptide, it may be necessary to add a start codon (ATG) to the oligonucleotide fragment 
containing the desired sequence to be expressed. It is well known in the art that a 
methionine at the N-terminal position can be enzymatically cleaved by the use of the 
enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben- 
25 Bassat et al, (1987) J. Bacteriol. 169:751-757) and Salmonella typhimurium and its in 
vitro activity has been demonstrated on recombinant proteins (Miller et al (1987) PNAS 
84:2718-1722). Therefore, removal of an N-terminal methionine, if desired, can be 
achieved either in vivo by expressing /fZ>x-derived polypeptides in a host which produces 
MAP (e.g., E. coli or CM89 or S, cerevisiae), or in vitro by use of purified MAP (e.g., 
30 procedure of Miller et al., supra). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a 
part of a fusion gene including a nucleotide sequence encoding a different polypeptide. 
This type of expression system can be useful under conditions where it is desirable to 
produce an immunogenic fragment of an HDx protein. For example, the VP6 capsid 
35 protein of rotavirus can be used as an immunologic carrier protein for portions of the 
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HDx polypeptide, either in the monomeric form or in the form of a viral particle. The 
nucleic acid sequences corresponding to the portion of a subject HDx protein to which 
antibodies are to be raised can be incorporated into a fusion gene construct which 
includes coding sequences for a late vaccinia virus structural protein to produce a set of 
recombinant viruses expressing fusion proteins comprising HDx epitopes as part of the 
virion. It has been demonstrated with the use of immunogenic fusion proteins utilizing 
the Hepatitis B surface antigen fusion proteins that recombinant Hepatitis B virions can 
be utilized in this role as well. Similarly, chimeric constructs coding for fusion proteins 
containing a portion of an HDx protein and the poliovirxis capsid protein can be created 
to enhance immunogenicity of the set of polypeptide antigens (see, for example, EP 
Publication No: 0259149; and Evans et al. (1989) Nature 339:385; Huang et al. (1988) 
J. Virol. 62:3855; and Schlienger et al. (1992) J. Virol. 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can also be 
utilized to generate an immunogen, wherein a desired portion of an HDx polypeptide is 
obtained directly fi-om organo-chemical synthesis of the peptide onto an oligomeric 
branching lysine core (see, for example, Posnett et al. (1988) JBC 263:1719 and Nardelli 
et al. (1992) J. Immunol. 148:914). Antigenic determinants of HDx proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
appreciated that fusion proteins can also facilitate the expression of proteins, and 
accordingly, can be used in the expression of the HDx polypeptides of the present 
invention. For example, HDx polypeptides can be generated as glutathione-S-transferase 
(GST-fiision) protems. Such GST-fiision proteins can enable easy purification of the 
HDx polypeptide, as for example by the use of glutathione-derivatized matrices (see, for 
example. Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley 
& Sons, 1991)). 

In another embodiment, a fusion gene coding for a purification leader sequence, 
such as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired 
portion of the recombinant protein, can allow purification of the expressed fusion protein 
by affinity chromatography using a Ni2+ metal resin. The purification leader sequence 
can then be subsequently removed by treatment with enterokinase to provide the purified 
protein (e.g., see Hochuli et al. (1987) J. Chromatography 411:177; and Janknecht et al 
PNAS 88:8972). 
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Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fragments coding for different polypeptide 
sequences is performed in accordance with conventional techniques, employing blunt- 
ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
the fusion gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively, PGR amplification of gene fragments can be carried out using 
anchor primers which give rise to complementary overhangs between two consecutive 
gene fragments which can subsequently be annealed to generate a chimeric gene sequence 
(see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John 
Wiley & Sons: 1992). 

HDx polypeptides may also be chemically modified to create HDx derivatives by 
forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of HDx 
proteins can be prepared by linking the chemical moieties to functional groups on amino 
acid sidechains of the protein or at the N-terminus or at the C-terminus of the 
polypeptide. 

The present invention also makes available isolated HDx polypeptides which are 
isolated from, or otherwise substantially free of other cellular proteins, especially other 
signal transduction factors and/or transcription factors which may normally be associated 
with the HDx polypeptide. The term "substantially free of other cellular proteins" (also 
referred to herein as "contaminating proteins") or "substantially pure or purified 
preparations" are defined as encompassing preparations HDx polypeptides having less 
than 20% (by dry weight) contaminating protein, and preferably having less than 5% 
contaminating protein. Functional forms of the subject polypeptides can be prepared, for 
the first time, as purified preparations by using a cloned gene as described herein. By 
"purified", it is meant, when referring to a peptide or DNA or KNA sequence, that the 
indicated molecule is present in the substantial absence of other biological 
macromolecules, such as other proteins. The term "purified" as used herein preferably 
means at least 80% by dry weight, more preferably in the range of 95-99% by weight, 
and most preferably at least 99.8% by weight, of biological macromolecules of the same 
type present (but water, buffers, and other small molecules, especially molecules having a 
molecular weight of less than 5000, can be present). The term "pure" as used herein 
preferably has the same numerical limits as "purified" immediately above. "Isolated" and 
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"purified" do not encompass either natural materials in their native state or natural 
materials that have been separated into components (e.g., in an acrylamide gel) but not 
obtained either as pure (e.g. lacking contaminating proteins, or chromatography reagents 
such as denaturing agents and polymers, e.g. aciylamide or agarose) substances or 
solutions. In preferred embodiments, purified HDx preparations will lack any 
contaminating proteins fi-om the same animal fi-om that HDx is normally produced, as can 
be accomplished by recombinant expression of, for example, a human HDx protein in a 
non-human cell. 

As described above for recombinant polypeptides, isolated HDx polypeptides can 
include all or a portion of an amino acid sequences corresponding to an HDx polypeptide 
represented in any one of SEQ ID Nos: 5-8 or homologous sequences thereto. In 
preferred embodiments, the HDx polypeptide includes both the v and x motifs, and 
preferably possess a histone deacetylase activity. 

Isolated peptidyl portions of HDx proteins can be obtained by screening peptides 
recombinantly produced fi-om the corresponding firagment of the nucleic acid encoding 
such peptides. In addition, fi-agments can be chemically synthesized using techniques 
known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. 
For example, an HDx polypeptide of the present invention may be arbitrarily divided into 
fi-agments of desired length with no overiap of the fi-agments, or preferably divided into 
overiapping fragments of a desired length. The fi-agments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptidyl fi-agments 
which can function as either agonists or antagonists of a wUd-type (e.g., "authentic") 
HDx protein. 

The recombinant HDx polypeptides of the present invention also include 
homologs of the authentic HDx proteins, such as versions of those protein which are 
resistant to proteolytic cleavage, as for example, due to mutations which alter 
ubiquitination or other enzymatic targeting associated with the protein. 

Modification of the structure of the subject HDx polypeptides can be for such 
purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf 
life and resistance to proteolytic degradation in vivo), or post-translational modifications 
(e.g., to alter phosphorylation pattern of protein). Such modified peptides, when 
designed to retain at least one activity of the naturally-occurring form of the protein, or 
to produce specific antagonists thereof, are considered fiinctional equivalents of the HDx 
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polypeptides described in more detail herein. Such modified peptides can be produced, 
for instance, by amino acid substitution, deletion, or addition. 

For example, it is reasonable to expect that an isolated replacement of a leucine 
with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
similar replacement of an amino acid with a structurally related amino acid (i.e. isosteric 
and/or isoelectric mutations) will not have a major effect on the biological activity of the 
resulting molecule. Conservative replacements are those that take place within a family of 
amino acids that are related in their side chains. Genetically encoded amino acids are can 
be divided into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, 
arginine, histidine; (3) nonpolar = alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar = glycine, asparagine, 
glutamine, cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine 
are sometimes classified jointly as aromatic amino acids. In similar fashion, the amino 
acid repertoire can be grouped as (1) acidic = aspartate, glutamate; (2) basic = lysine, 
15 arginine histidine, (3) aliphatic = glycine, alanine, valine, leucine, isoleucine, serine, 
threonine, with serine and threonine optionally be grouped separately as aliphatic- 
hydroxyl; (4) aromatic = phenylalanine, tyrosine, tryptophan; (5) amide = asparagine, 
glutamine; and (6) sulfiar -containing = cysteine and methionine, (see, for example. 
Biochemistry, 2nd ed., Ed. by L. Stryer, WH Freeman and Co.: 1981). Whether a 
20 change in the amino acid sequence of a peptide results in a fimctional HDx homolog (e.g. 
fiinctional in the sense that the resulting polypeptide mimics or antagonizes the wild-type 
form) can be readily determined by assessing the ability of the variant peptide to produce 
a response in cells in a fashion similar to the wild-type protein, or competitively inhibit 
such a response. Polypeptides in which more than one replacement has taken place can 
25 readily be tested in the same manner. 

This invention further contemplates a method for generating sets of combinatorial 
mutants of the subject HDx proteins as well as truncation mutants, and is especially 
useful for identifying potential variant sequences (e.g. homologs) that are functional in 
modulating histone deacetylation. The purpose of screening such combinatorial libraries 
is to generate, for example, novel HDx homologs which can act as either agonists or 
antagonist, or alternatively, possess novel activities all together. To illustrate, HDx 
homologs can be engineered by the present method to provide selective, constitutive 
activation of enzymatic activity. Thus, combinatorially-derived homologs can be 
generated to have an increased potency relative to a naturally occurring form of the 
35 protein. 
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Likewise, HDx homologs can be generated by the present combinatorial approach 
to selectively inhibit (antagonize) histone deacetylation. For instance, mutagenesis can 
provide HDx homologs which are able to bind other regulatory proteins or cytoskeletal 
elements (or DNA) yet prevent acetylation of histones, e.g. the homologs can be 
5 dominant negative mutants. In a preferred embodiment, a dominant negative mutant of 
an HDx protein is mutated at one or more residues of its catalytic site and/or specificity 
subsites. 

In one aspect of this method, the amino acid sequences for a population of HDx 
homologs or other related proteins are aligned, preferably to promote the highest 
10 homology possible. Such a population of variants can include, for example, HDx 
homologs from one or more species. Amino acids which appear at each position of the 
aligned sequences are selected to create a degenerate set of combinatorial sequences. In 
a preferred embodiment, the variegated library of HDx variants is generated by 
combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene 
library. For instance, a mixture of synthetic oligonucleotides can be enzymatically ligated 
into gene sequences such that the degenerate set of potential HDx sequences are 
expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins 
(e.g. for phage display) containing the set of HDx sequences therein. 

As illustrated in Figure 5B, to analyze the sequences of a population of variants, 
the amino acid sequences of interest can be aligned relative to sequence homology. The 
presence or absence of amino acids from an aligned sequence of a particular variant is 
relative to a chosen consensus length of a reference sequence, which can be real or 
artificial. For instance, Figure 5B includes the alignment of the v and x-motifs for several 
of the HDx gene products. Analysis of the alignment of these sequences from the HDx 
clones can give rise to the generation of a degenerate library of polypeptides comprising 
potential HDx sequences. In an exemplary embodiment, a library of variants based on 
the HDJ sequence, but degenerate across each of the v and x-motifs can be provided. 
On such library can be represented by the general formula A-(v motif)-B-(x motif)-C, 
wherein the v motif is an amino acid sequence represented in the general formula 

DIAXimVAGGLHHAKKX2EASGFCYVNDIVX3X4lLELLKYHX5RVLYroroiHHGDGX6EEAFYX 
7TD-RVMTVSF 

the X motif is an amino acid sequence represented in the general formula 
CVEX8VKX9FNXioPLLXiiLGGGGYTXi2RNVARCWTYET 
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A corresponds to Metl-Thrl29 of SEQ ID No. 5, B coiresponds to Hisl99-Lys283 of 
SEQ ID No. 5, and C corresponds to Ala3 1 7-Ala482 of SEQ ID No 5 wherein Xj 
represents He or Val; X^ represents Phe or Ser; X3 represents Phe or Leu; X4 represents 
Gly or Ala; X5 represents Pro or Gin; X^ represents Gin or GIu; Xj represents Leu or 
Thr; Xg represents Val or Tyr;,X9 represents Thr or Ser; X^o represents Leu or He- X^, 
represents Met or Val; and X^^ represents He or Val. To further expand the 
combmatonal set, other conservative mutations relative to those appearing in the human 
sequences can be provided. For example, in a more expansive libraiy, X^ represents Gly 
Ala. Val, He or Leu; X2 represents Phe. Tyr, Thr or Ser; X3 represents Phe, Tyr Gly' 
Ala, Val, He or Leu; X4 represents Gly. Ala, Val, He or Leu; X5 represents Pro, Asn or 
Gin; Xg represents Asn, Gin, Asp or GIu; X7 represents Gly, Ala, Val, He, Leu Ser or 
Thr; Xg represents Gly, Ala, Val, He, Leu, Phe or Tyr; Xp represents Thr, Cys,'or Ser- 
Xio represents Gly, Ala, Val, He or Leu; X^ represents Met. Cys. Gly, Ala, Val, He,' 
Leu, Ser or Thr; and represents Gly, Ala, Val, He or Leu. In still another library' 
each degenerate position can be any one of the naturally occurring amino acids! 
Likewise, the v and x-motifs can correspond to the degenerate sequences designated by 
SEQ ID Nos. 12 and 14, respectively. 

There are many ways by which such libraries of potential HDx homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the 
synthetic genes then ligated into an appropriate expression vector. The purpose of a 
degenerate set of genes is to provide, in one mixture, all of the sequences encoding the 
desired set of potential HDx sequences. The synthesis of degenerate oligonucleotides is 
well known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et 
al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed AG 
Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem 
53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 
1 1 :477. Such techniques have been employed in the directed evolution of other proteins 
(see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 
89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwiria et al. (1990) PNAS 87: 
6378-6382; as well as U.S. Patents Nos. 5,223,409, 5, 198,346, and 5,096,815). 

Likewise, a library of coding sequence fragments can be provided for an HDx 
clone in order to generate a variegated population of HDx fragments for screening and 
subsequent selection of bioactive fragments. A variety of techniques are known in the art 
for generating such libraries, including chemical synthesis. In one embodiment, a library 
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of coding sequence fragments can be generated by (i) treating a double stranded PGR 
fragment of an HDx coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule; (ii) denaturing the double stranded DNA; (iii) 
renaturing the DNA to form double stranded DNA which can include sense/antisense 
pairs from different nicked products; (iv) removing single stranded portions from 
reformed duplexes by treatment with SI nuclease; and (v) ligating the resulting fragment 
library into an expression vector. By this exemplary method, an expression library can be 
derived which codes for N-terminal, C-terminal and internal fragments of various sizes. 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a certain property. Such techniques will be generally 
adaptable for rapid screening of the gene libraries generated by the combinatorial 
mutagenesis HDx homologs. The most widely used techniques for screening large 
gene libraries typically comprises cloning the gene library into replicable expression 
vectors, transforming appropriate cells with the resulting library of vectors, and 
expressing the combinatorial genes under conditions in which detection of a desired 
activity facilitates relatively easy isolation of the vector encoding the gene whose product 
was detected. 

In an exemplary embodiment, the library of HDx variants is expressed as a fusion 
20 protein on the surface of a viral particle For instance, in the filamentous phage system, 
foreign peptide sequences can be expressed on the surface of infectious phage, thereby 
conferring two significant benefits. First, since these phage can be applied to affinity 
matrices at very high concentrations, a large number of phage can be screened at one 
time. Second, since each infectious phage displays the combinatorial gene product on its 
25 surface, if a particular phage is recovered from an affinity matrix in low yield, the phage 
can be amplified by another round of infection. The group of almost identical E. coli 
filamentous phages M13, fd., and fl are most often used in phage display libraries, as 
either of the phage glll or gVIU coat proteins can be used to generate fusion proteins 
without disrupting the ultimate packaging of the viral particle (Ladner et al. PCT 
30 publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. 
(1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457- 
4461). 

For example, the recombinant phage antibody system (RPAS, Pharmacia Catalog 
35 number 27-9400-01) can be easily modified for use in expressing and screening HDx 
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combinatorial libraries by panning on glutathione immobilized histones/GST fusion 
proteins or RbAp48/GST fusion protein to enrich for HDx homologs which retain an 
ability to bind a substrate or regulatory protein. Each of these HDx homologs can 
subsequently be screened for further biological activities in order to differentiate agonists 
and antagonists. For example, histone-binding homologs isolated from the combinatorial 
library can be tested for their enzymatic activity directly, or for their effect on cellular 
proliferation relative to the wild-type form of the protein. 

The invention also provides for reduction of the HDx or RbAp48 or histones 
proteins to generate mimetics, e.g. peptide or non-peptide agents, which are able to 
disrupt a biological activity of an HDx polypeptide of the present invention, e.g. as 
catalytic inhibitor or an inhibitor of protein-protein interactions. Thus, such mutagenic 
techniques as described above are also useful to map the determinants of the HDx 
proteins which participate in protein-protein or protein-DNA interactions involved in, for 
example, interaction of the subject HDx polypeptide with histones, RbAp48 or 
cytoskeletal elements. To illustrate, the critical residues of a subject HDx polypeptide 
which are involved in molecular recognition of histones can be detennined and used to 
generate /fZ^x-derived peptidomimetics which competitively inhibit binding of the 
authentic HDx protein with that moiety. Likewise, residues of a histone or of RbAp48 
involved in binding to HDx proteins can be identified, and peptides or peptidomimetics 
20 based on such residues can also be used as competitive inhibitors of the interaction of an 
HDx protein with either of those proteins. By employing, for example, scanning 
mutagenesis to map the amino acid residues of a protein which is involved in binding 
other proteins, peptidomimetic compounds can be generated which mimic those residues 
which facilitate the interaction. Such mimetics may then be used to interfere with the 
normal function of an HDx protein. For instance, non-hydrolyzable peptide analogs of 
such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher; Leiden, 
Netheriands, 1988), azepine (e.g., see HuflBnan et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gamma 
lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netheriands, 1988), keto-methylene pseudopeptides 
(Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure 
and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical 
Co. Rockland, IL, 1985), P-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Utt 
1^.(>A1; and Sato et al. (1986) J CAe/w Soc Perkin Trans 1:1231), and P-aminoalcohols 
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(Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Dann et al. (1986) 
Biochem Biophys Res Commun 134:71). 

Another aspect of the invention pertains to an antibody specifically reactive with 
an HDx protein. For example, by using immunogens derived from an HDx protein, e.g. 
based on the cDNA sequences, anti-protein/anti-peptide antisera or monoclonal 
antibodies can be made by standard protocols (See, for example. Antibodies: A 
Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A 
mammal, such as a mouse, a hamster or rabbit can be immunized with an immunogenic 
form of the peptide (e.g., an HDx polypeptide or an antigenic fragment which is capable 
of eliciting an antibody response). Techniques for conferring immunogenicity on a 
protein or peptide include conjugation to carriers or other techniques well known in the 
art. An immunogenic portion of an HDx protein can be administered in the presence of 
adjuvant. The progress of immunization can be monitored by detection of antibody titers 
in plasma or senim. Standard ELISA or other immunoassays can be used with the 
immunogen as antigen to assess the levels of antibodies. In a preferred embodiment, the 
subject antibodies are immunospecific for antigenic determinants of an HDx protein of a 
organism, such as a mammal, e.g. antigenic determinants of a protein represented by one 
of SEQ ID Nos: 5-8 or closely related homologs (e.g. at least 85% homologous, 
preferably at least 90% homologous, and more preferably at least 95% homologous). In 
yet a further preferred embodiment of the present invention, in order to provide, for 
example, antibodies which are immuno-selective for discrete HDx homologs, e.g. HDI, 
the anti-HDx polypeptide antibodies do not substantially cross react (i.e. does not react 
specifically) with a protein which is, for example, less than 85%, 90% or 95% 
homologous with the selected HDx. By "not substantially cross react", it is meant that 
the antibody has a binding affinity for a non-homologous protein which is at least one 
order of magnitude, more preferably at least 2 orders of magnitude, and even more 
preferably at least 3 orders of magnitude less than the binding affinity of the antibody for 
the intended target HDx. 

Following immunization of an animal with an antigenic preparation of an HDx 
polypeptide, anti-HDx antisera can be obtained and, if desired, polyclonal anti~HDx 
antibodies isolated from the serum. To produce monoclonal antibodies, antibody- 
producing cells (lymphocytes) can be harvested from an immunized animal and fused by 
standard somatic cell fiision procedures with immortalizing cells such as myeloma ceUs to 
yield hybridoma cells. Such techniques are well known in the art, an include, for 
example, the hybridoma technique (originally developed by Kohler and Milstein, (1975) 
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Nature, 256: 495-497). the human B ceU hybridoma technique (Kozbar et al (1983) 
Immunology Today, 4: 72), and the EBV-hybridoma technique to produce' human 
monoclonal antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer Therapy, 
Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened immunochemically for 
production of antibodies specifically reactive with an HDx polypeptide of the present 
mvention and monoclonal antibodies isolated fi-om a culture comprising such hybridoma 
cells 



The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with one of the subject HDx polypeptides. Antibodies can be 
fi-agmented using conventional techniques and the fi-agments screened for utility in the 
same manner as described above for whole antibodies. For example, F(ab)2 fragments 
can be generated by treating antibody with pepsin. The resulting F(ab)2 fragment can be 
treated to reduce disulfide bridges to produce Fab fragments. The antibody of the 
present invention is further intended to include bispecific and chimeric molecules having 
affinity for an HDx protein conferred by at least one CDR region of the antibody. 

Both monoclonal and polyclonal antibodies (Ab) directed against authentic HDx 
polypeptides, ox HDx variants, and antibody fragments such as Fab, F(ab)2, Fv and scFv 
can be used to block the action of one or more HDx proteins and allow the study of the 
role of these proteins in, for example, differentiation of tissue. Experiments of this nature 
can aid in deciphering the role of HDx proteins that may be involved in control of 
proliferation versus differentiation, e.g., in patterning and tissue formation. 

Antibodies which specifically bind HDx epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of each of the subject HDx polypeptides. Anti-i/Z)x antibodies can 
be used diagnostically in immuno-precipitation and immuno-blotting to detect and 
evaluate HDx protein levels in tissue as part of a clinical testing procedure. For instance 
such measurements can be usefiil in predictive valuations of the onset or progression of 
proliferative or differentiative disorders. Likewise, the ability to monitor HDx protein 
levels in an individual can allow determination of the efficacy of a given treatment 
regimen for an individual afflicted with such a disorder. The level of HDx polypeptides 
may be measured from cells in bodily fluid, such as in samples of cerebral spinal fluid or 
amniotic fluid, or can be measured in tissue, such as produced by biopsy. Diagnostic 
assays using zxAx-HDx antibodies can include, for example, immunoassays designed to aid 
in early diagnosis of a disorder, particularly ones which are manifest at birth. Diagnostic 
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assays using anti-^Z)x polypeptide antibodies can also include immunoassays designed to 
aid m early diagnosis and phenotyping neoplastic or hyperplastic disorders. 

Another application of anti-/«)x antibodies of the present invention is in the 
immunological screening of cDNA libraries constructed in expression vectors such as X 
^ gtll. A,gtl8-23, AZAP.and^ORFS. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance, Xgtl 1 will produce fusion proteins whose amino tennini consist 
of B-galactos.dase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of an HDx protein, e.g. other orthologs of a particular 
HDx protein or other paralogs from the same species, can then be detected with 
antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with 
anU-HDx antibodies. Positive phage detected by this assay can then be isolated from the 
infected plate. Thus, the presence orHDx homologs can be detected and cloned from 
other ammals, as can alternate isoforms (including splicing variants) from humans. 

Moreover, the nucleotide sequences determined from the cloning of HDx genes 
from organisms will further allow for the generation of probes and primers designed for 
use m Identifying and/or cloning HDx homologs in other cell types, e.g. from other 
tissues, as well as HDx homologs from other organisms. For instance, the present 
invention also provides a probe/primer comprising a substantially purified 
oligonucleotide, which oligonucleotide comprises a region of nucleotide sequence that 
hybndizes under stringent conditions to at least 10 consecutive nucleotides of sense or 
anti-sense sequence selected from the group consisting of SEQ ID Nos: 1^ or naturally 
occumng mutants thereof For instance, primers based on the nucleic acid represented in 
SEQ ID Nos: 1-4 can be used in PGR reactions to clone HDx homologs Likewise 
probes based on the subject HDx sequences can be used to detect transcripts or genomic 
sequences encoding the same or homologous proteins. In prefeired embodiments the 
probe further comprises a label group attached thereto and able to be detected e g the 
label group is selected from amongst radioisotopes, fluorescent compounds, enzymes 
and enzyme co-factors. 

Such probes can also be used as a part of a diagnostic test kit for identifying cells 
or tissue which misexpress an HDx protein, such as by measuring a level of an HDx- 
encoding nucleic acid in a sample of cells from a patient; e.g. detecting HDx mRNA 
levels or determimng whether a genomic HDx gene has been mutated or deleted 
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To illustrate, nucleotide probes can be generated from the subject HDx genes 
which facilitate histological screening of intact tissue and tissue samples for the presence 
(or absence) of iTOx-encoding transcripts. Similar to the diagnostic uses of anti-iffltr 
antibodies, the use of probes directed to HDx messages, or to genomic HDx sequences, 
5 can be used for both predictive and therapeutic evaluation of allelic mutations which 
might be manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell 
growth) or abnormal differentiation of tissue. Used in conjunction with immunoassays as 
described above, the oligonucleotide probes can help facilitate the determination of the 
molecular basis for a developmental disorder which may involve some abnormality 
10 associated with expression (or lack thereof) of an HDx protein. For instance, variation in 
polypeptide synthesis can be differentiated from a mutation in a coding sequence. 

Accordingly, the present method provides a method for determining if a subject is 
at risk for a disorder characterized by aberrant cell proliferation and/or differentiation. In 
preferred embodiments, method can be generally characterized as comprising detecting, 

15 in a sample of cells from the subject, the presence or absence of a genetic lesion 
characterized by at least one of (i) an alteration affecting the integrity of a gene encoding 
an ffl)x-protein, or (ii) the mis-expression of the HDx gene. To illustrate, such genetic 
lesions can be detected by ascertaining the existence of at least one of (i) a deletion of 
one or more nucleotides from an HDx gene, (ii) an addition of one or more nucleotides to 

20 an HDx gene, (iii) a substitution of one or more nucleotides of an HDx gene, (iv) a gross 
chromosomal rearrangement of an HDx gene, (v) a gross alteration in the level of a 
messenger RNA transcript of an HDx gene, (vii) aberrant modification of an HDx gene, 
such as of the methylation pattern of the genomic DNA, (vii) the presence of a non-wild 
type splicing pattern of a messenger RNA transcript of an HDx gene, (viii) a non-wild 

25 type level of an JTOx-protein, and (ix) inappropriate post-translational modification of an 
/ZDx-protein. As set out below, the present invention provides a large number of assay 
techniques for detecting lesions in an HDx gene, and importantly, provides the ability to 
discern between different molecular causes underlying /fDx-dependent aberrant cell 
growth, proliferation and/or differentiation. 

30 In an exemplary embodiment, there is provided a nucleic acid composition 

comprising a (purified) oligonucleotide probe including a region of nucleotide sequence 
which is capable of hybridizing to a sense or antisense sequence of an HDx gene, such as 
represented by any of SEQ ID Nos: 1-4, or naturally occurring mutants thereof, or 5* or 
3* flanking sequences or intronic sequences naturally associated with the subject HDx 

35 genes or naturally occurring mutants thereof The nucleic acid of a cell is rendered 
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accessible for hybridization, the probe is exposed to nucleic acid of the sample, and the 
hybridization of the probe to the sample nucleic acid is detected. Such techniques can be 
used to detect lesions at either the genomic or mRNA level, including deletions, 
substitutions, etc., as well as to determine mRNA transcript levels. 

In certain embodiments, detection of the lesion comprises utilizing the 
probe/primer in a polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos. 4,683,195 
and 4,683,202), such as anchor PGR or RAGE PGR, or, alternatively, in a ligation chain 
reaction (LGR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1944) PNAS 91:360-364), the later of which can be particularly useful 
for detecting point mutations in the HDx gene. In a merely illustrative embodiment, the 
method includes the steps of (i) collecting a sample of cells from a patient, (ii) isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, (iii) contacting 
the nucleic acid sample with one or more primers which specifically hybridize to an HDx 
gene under conditions such that hybridization and amplification of the HDx gene (if 
present) occurs, and (iv) detecting the presence or absence of an amplification product, 
or detecting the size of the amplification product and comparing the length to a control 
sample. 

In still another embodiment, the level of an ZTOjc-protein can be detected by 
immunoassay. For instance, the cells of a biopsy sample can be lysed, and the level of an 
/fZ)j:-protein present in the cell can be quantitated by standard immunoassay techniques. 
In yet another exemplary embodiment, aberrant methylation patterns of an HDx gene can 
be detected by digesting genomic DNA from a patient sample with one or more 
restriction endonucleases that are sensitive to methylation and for which recognition sites 
exist in the HDx gene (including in the flanking and intronic sequences). See, for 
example, Suiting et al. (1994) Human Mol Genet 3:893-895. Digested DNA is separated 
by gel electrophoresis, and hybridized with probes derived from, for example, genomic or 
cDNA sequences. The methylation status of the HDx gene can be determined by 
comparison of the restriction pattern generated from the sample DNA with that for a 
standard of known methylation. 

In yet another aspect of the invention, the subject HDx polypeptides can be used 
to generate a "two hybrid" assay or an "interaction trap" assay (see, for example, U.S. 
Patent No. 5,283,3 17; Zervos et al. (1993) Gell 72:223-232; Madura et al. (1993) J Biol 
Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; and Brent WO94/10300), for isolating coding sequences 
for other cellular proteins which bind HDx% ("i/Dx-binding proteins" or "^TOx-bp"). 
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Such /TOx-binding proteins would Ukely be involved in the regulation of HDx e g as 
regulatory subunits or transducers, or be substrates which are regulated by an HDx. 

Briefly, the interaction trap relies on reconstituting in vivo a functional 
transcnptional activator protein from two separate fusion proteins. In particular the 
method makes use of chimeric genes which express hybrid proteins. To illustrate a first 
hybnd gene comprises the coding sequence for a DNA-binding domain of a 
transcriptional activator fused in frame to the coding sequence for an HDx polypeptide 
The second hybnd protein encodes a transcriptional activation domain fused in frame to a 
sample gene from a cDNA library. If the bait and sample hybrid proteins are able to 
mteract, e.g., form an ^Z)x-dependent complex, they bring into close proximity the two 
domams of the transcriptional activator. This proximity is sufficient to cause 
transcription of a reporter gene which is operably linked to a transcriptional regulatory 
site responsive to the transcriptional activator, and expression of the reporter gene can be 
detected and used to score for the interaction of the /©x and sample proteins. 

Furthermore, by making available purified and recombinant HDx polypeptides 
the present invention facilitates the development of assays which can be used to screen 
for drugs, including HDx homologs, which are either agonists or antagonists of the 
normal cellular fiinction of the subject HDx polypeptides, or of their role in the 
pathogenesis of cellular differentiation and/or proliferation and disorders related thereto 
Moreover, because we have also identified /«)x-related proteins, such as the yeast RPD3 
proteins, as histone deacetylases. the presem invention further provides drug screening 
assays for detecting agents which modulate the bioactivity of /TOx-related proteins Such 
agents, when directed to, for example, fungal /«)x-related proteins, can be used in the 
treatment of various infections. In a general sense, the assay evaluates the ability of a 
compound to modulate binding between an HDx polypeptide and a molecule be it 
protem or DNA, that interacts with the HDx polypeptide. It will be apparent from the 
following description of exemplary assays that, in place of a human (or other mammalian) 
HDx protein, the assay can be derived with an /tt)x-related protein such as RPD3 
Likewise, in place of human RbAp48 or Sin3A, other ^Z?x-binding proteins can be used, 
e.g., other human proteins. Exemplary compounds which can be screened include 
peptides, nucleic acids, carbohydrates, small organic molecules, and natural product 
extract hbraries. such as isolated from animals, plants, fiingus and/or microbes. 

It is contemplated that any of the novel interactions described herein could be 
exploited in a drug screening assay. For example, in one embodiment, the interaction 
between an /©x protein and RbAp48 can be detected in the presence and the absence of 
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a test compound. In another embodiment, the ability of a compound to modulate the 
binding of an^Dx protein, or ^Z)x-related protein such as the yeast RPD3, with histones 
can be assessed. The identification of a test compound which influences, for example, 
HDl catalyzed deacetylation of histones would be useful in the modulation of HDl 
5 activity in mammalian cells, while the identification of a test compound which selectively 
inhibits the yeast RPD3 deacetylase activity would be useful as an antifungal agent. In 
other embodiments the effect of a test compound on the binding of an HDx protein to 
other molecules, such as cytoskeletal components, or other proteins identified by the 
JK>x-dependent ITS set out above, could be tested. A variety of assay formats vvall 
10 suffice and, in light of the present inventions, will be comprehended by a skilled artisan. 

In a preferred embodiment, assays which employ the subject mammalian HDx 
proteins can be used to identify compounds that have therapeutic indexes more favorable 
than sodium butyrate, trapoxin, trichostatin or the like. For instance, trapoxin-like drugs 
can be identified by the present invention which have enhanced tissue-type or cell-type 
15 specificity relative to trapoxin. To illustrate, the subject assays can be used to generate 
compounds which preferentially inhibit IL-2 mediated proliferation/activation of 
lymphocytes, or inhibit proliferation of certain tumor ceUs. without substantially 
interfering with other tissues, e.g. hepatocytes. Likewise, similar assays can be used to 
identify dmgs which inhibit proliferation of yeast cells or other lower eukaryotes, but 
20 which have a substantially reduced effect on mammalian cells, thereby improving 
therapeutic index of the drug as an anti-mycotic agent. 

In one embodiment, the identification of such compounds is made possible by the 
use of differential screening assays which detect and compare drug-mediated inhibition of 
deacetylase activity between two or more different i/Dx-like enzymes, or compare drug- 

25 mediated inhibition of formation of complexes involving two or more different types of 
i7£)x-like proteins. To illustrate, the assay can be designed for. side-by-side comparison 
of the effect of a test compound on the deacetylase activity or protein interactions of 
tissue-type specific HDx proteins. Given the apparent diversity oi HDx proteins, it is 
probable that different fimctional HDx activities, or HDx complexes exist and, in certain 

30 instances, are localized to particular tissue or cell types. Thus, test compounds can be 
screened for agents able to inhibit the tissue-specific formation of only a subset of the 
possible repertoire of iTOx/regulatory protein complexes, or which preferentially inhibit 
certain HDx enzymes. In an exemplary embodiment, an interaction trap assay can be 
derived using two or more different human HDx "bait" proteins, while the "fish" protein 

35 is constant in each, e.g. a human RbAp48 construct. Running the interaction trap side- 
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by-side permits the detection of agents which have a greater effect (e.g. statistically 
significant) on the formation of one of the HDx/RbAp4S complexes than on the formation 
of the other HDx complexes. 

In similar fashion, diflferential screening assays can be used to exploit the 
difference in protein interactions and/or catalytic mechanism of mammalian HDx proteins 
and yeast RPD3 proteins in order to identify agents which display a statistically significant 
increase in specificity for inhibiting the yeast enzyme relative to the mammalian enzyme. 
Thus, lead compounds which act specifically on pathogens, such as fimgus involved in 
mycotic infections, can be developed. By way of illustration, the present assays can be 
used to screen for agents which may ultimately be useful for inhibiting at least one fungus 
implicated in such mycosis as candidiasis, aspergillosis, mucormycosis, blastomycosis, 
geotrichosis, cryptococcosis, chromoblastomycosis, coccidioidomycosis, conidiosporosis, 
histoplasmosis, maduromycosis, rhinosporidosis, nocaidiosis, para-actinomycosis, 
penicilliosis, monoliasis, or sporotrichosis. For example, if the mycotic infection to which 
treatment is desired is candidiasis, the present assay can comprise comparing the relative 
effectiveness of a test compound on inhibiting the deacetylase activity of a mammalian 
HDx protein with its effectiveness towards inhibiting the deacetylase activity of an RPD3 
homolog cloned fi-om yeast selected fi-om the group consisting of Candida albicans, 
Candida stellatoidea, Candida tropicalis, Candida parapsilosis, Candida krusei, 
Candida pseudotropicalis, Candida quillermondii, or Candida rugosa. Likewise, the 
present assay can be used to identify anti-flingal agents which may have therapeutic value 
in the treatment of aspergillosis by selectively targeting RPD3 homologs cloned fi-om 
yeast such as Aspergillus fumigatus, Aspergillus flavus, Aspergillus niger, 
Aspergillus nidulans, or Aspergillus terreus. Where the mycotic infection is 
mucormycosis, the RPD3 deacetylase can be derived fi-om yeast such as Rhizopus 
arrhizus, Rhizopus oiyzae, Absidia corymbifera, Absidia ramosa, or Mucor pusillus. 
Sources of other /?P£)3 activities for comparison with a mammalian HDx activity includes 
the pathogen Pneumocystis carinii. 

In addition to such HDx therapeutic uses, anti-fimgal agents developed with such 
differential screening assays can be used, for example, as preservatives in foodstuff, feed 
supplement for promoting weight gain in livestock, or in disinfectant formulations for 
treatment of non-living matter, e.g., for decontaminating hospital equipment and rooms. 

In similar fashion, side by side comparison of inhibition of a mammalian HDx 
proteins and an insect /tt)x-related proteins, will permit selection oiHDx inhibitors which 
discriminate between the human/mammalian and insect enzymes. Accordingly, the 
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present invention expressly contemplates the use and formulations of the subject HDx 
therapeutics in insecticides, such as for use in management of insects like the fruit fly. 

In yet another embodiment, certain of the subject HDx inhibitors can be selected 
on the basis of inhibitory specificity for plant //Z)x-related activities relative to the 
mammalian enzyme. For example, a plant .WDx-related protein can be disposed in a 
differential screen with one or more of the human enzymes to select those compounds of 
greatest selectivity for inhibiting the plant enzyme. Thus, the present invention 
specifically contemplates formulations of the subject HDx inhibitors for agricultural 
applications, such as in the form of a defoliant or the like. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 
preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with upstream or downstream elements: Accordingly, in 
an exemplary screening assay of the present invention, a reaction mixture is generated to 
include an HDx polypeptide, compound(s) of interest, and a "target polypeptide", e.g., a 
protein, which interacts with the HDx polypeptide, whether as a substrate or by some 
other protein-protein interaction. Exemplary target polypeptides include histones, 
RbAp48 polypeptides, Sin3 polypeptides, and/or combinantions thereof or with other 
transciptional regulatory proteins (such as myc, max, etc, see Example 3)). Detection 
and quantification of complexes containing the HDx protein provide a means for 
determining a compound's efficacy at inhibiting (or potentiating) complex formation 
between the HDx and the target polypeptide. The efficacy of the compound can be 
assessed by generating dose response curves from data obtained using various 
concentrations of the test compound. Moreover, a control assay can also be performed 
to provide a baseline for comparison. In the control assay, isolated and purified HDx 
polypeptide is added to a composition containing the target polypeptide and the 
formation of a complex is quantitated in the absence of the test compound. 

Complex formation between the HDx polypeptide and the target polypeptide may 
be detected by a variety of techniques. Modulation of the formation of complexes can be 
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quantuated using, for example, detectably labeled proteins such as radiolabeled 
fluorescently labeled, or enzymatically labeled HDx polypeptides, by immunoassay hy 
chromatographic detection, or by detecting the intrinsic activity of the acetylase. 

Typically, it will be desirable to immobilize either HDx or the target polypeptide 
> to facilitate separation of complexes from uncomplexed forms of one or both of the 
protems, as well as to accommodate automation of the assay. Binding of HDx to the 
target polypeptide, in the presence and absence of a candidate agent can be 
accomplished in any vessel suitable for containing the reactants. Examples include 
microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided which adds a domain that allows the protein to be bound to a 
matnx. For example, glutathione-S-transferase/^Z)x (GST/HDx) fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis MO) or 
glutath.one derivatized microtitre plates, which are then combined with the cell lysates 
e.g. an S-labeled, and the test compound, and the mixture incubated under conditions 
conducive to complex formation, e.g. at physiological conditions for salt and pH, though 
shghtly more stringent conditions may be desired. Following incubation, the beads are 
washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly (e.g. beads placed in scintiUant), or in the supernatant after the 
complexes are subsequently dissociated. Alternatively, the complexes can be dissociated 
from the matrix, separated by SDS-PAGE, and the level of /^i)x-binding protein found in 
the bead fraction quantitated from the gel using standard electrophoretic techniques such 
as described in the appended examples. 

Other techniques for immobilizing proteins on matrices are also available for use 
m the subject assay. For instance, either HDx or target polypeptide can be immobilized 
utihzmg conjugation of biotin and streptavidin. For instance, biotinylated HDx molecules 
can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known 
m the art (e.g.. b.otinylation kit. Pierce Chemicals, Rockford, IL), and immobilized in the 
wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies 
reactive with HDx, but which do not interfere with the interaction between the HDx and 
target polypeptide, can be derivatized to the wells of the plate, and HDx trapped in the 
wells by antibody conjugation. As above, preparations of an target polypeptide and a test 
compound are incubated in the ffl)x-presenting wells of the plate, and the amount of 
complex trapped in the well can be quantitated. Exemplary methods for detecting such 
complexes, in addition to those described above for the GST-immobilized complexes 
include immunodetection of complexes using antibodies reactive with the target 
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polypeptide, or which are reactive with HDx protein and compete with the target 
polypeptide; as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the target polypeptide, either intrinsic or extrinsic activity. In the 
instance of the latter, the enzyme can be chemically conjugated or provided as a fUsion 
protein with the target polypeptide. To illustrate, the target polypeptide can be 
chemically cross-linked or genetically fused with horseradish peroxidase, and the amount 
of polypeptide trapped in the complex can be assessed with a chromogenic substrate of 
the enzyme, e.g. 3,3'-diamino-benzadine terahydrochloride or 4-chloro-l-napthol. 
Likewise, a fusion protein comprising the polypeptide and glutathione-S-transferase can 
be provided, and complex formation quantitated by detecting the GST activity using 1- 
chloro-2,4-dimtrobenzene (Habig et al (1974) J Biol Chem 249:7130). 

For processes which rely on immunodetection for quantitating one of the proteins 
trapped in the complex, antibodies against the protein, such as anii-HDx antibodies, can 
be used. Alternatively, the protein to be detected in the complex can be "epitope tagged" 
1 5 in the form of a fusion protein which includes, in addition to the HDx sequence, a second 
polypeptide for which antibodies are readily available (e.g. from commercial sources). 
For instance, the GST fusion proteins described above can also be used for quantification 
of binding using antibodies against the GST moiety. Other useful epitope tags include 
myc-epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21150-21157) which 
includes a 10-residue sequence from c-myc, as well as the pFLAG system (International 
Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, NJ). 

In another embodiment of a drug screening, a two hybrid assay can be generated 
with an HDx and .«Z)a:-binding protein. Drug dependent inhibition or potentiation of the 
interaction can be scored. 

Where the HDx proteins themselves, or in complexes with other proteins, are 
capable of binding DNA and modifying transcription of a gene, a transcriptional based 
assay usmg, for example, an transcriptional regulatory sequences responsive to HDx 
complexes operably linked to a detectable marker gene. For illustration, see Example 3. 

To test the effect of a histone deacetylase inhibitor on MadN35GALVP16 and 
Mad(Pro)N35GALVPI6 mediated repression, we treated a duplicate set of transfections 
with 10 nM trapoxin for eight hours prior to harvest. In the representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 
fold while it had little effect on the activity of Mad(Pro)N35GALVP16, suggesting that 
the histone deacetylation plays a direct role in mSin3A transcriptional repression (Figure 
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13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfected with the expression vector alone or in 
cells transfected with GALVP16 (data not shown). Following trapoxin treatment the 

^HrTJ'r'" ^^'^N^^^^^l^ still seven times greater than thit of 
Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A-mediated repression- 
however, we can not rule out that mSin3A is capable of repression by mechanisms 

independent of histone deacetylation. 



Furthennore, each of the assay systems set out above can be generated in a 
differential" format as set forth above. That is, the assay format can provide information 
regarding specificity as well as potency. For instance, side-by-side comparison of a test 
compound's effect on diflFerent HDxs can provide information on selectivity and permit 
the Identification of compounds which selectively modulate the bioactivity of only a 
15 subset ofthe^Dx family. 

Furthermore, inhibitors of the enzymatic activity of each of the subject HDx 
protems can be identified using assays derived fi-om measuring the ability of an agent to 
lahibit catalytic conversion of a substrate by the subject proteins. For example, the ability 
of the subject HDx proteins to deacetylate a histone substrate, such as histone H4 (see 
examples), m the presence and absence of a candidate inhibitor, can be determined using 
standard enzymatic assays. 

A number of methods have been employed in the art for assaying histone 
deacetylase activity, and can be incorporated in the . drug screening assays of the present 
invention. In preferred embodiments, the assay will employ a labeled acetyl group linked 
to appropnate histone lysine residues as substrates. In other embodiments, a histone 
substrate peptide can be labeled with a group whose signal is dependent on the 
simultaneous presence or absence of an acetyl group, e.g., the label can be a fluorogenic 
group whose fluorescence is modulated (either quenched or potentiated) by the presence 
of the acetyl moiety. Using standard enzymatic analysis, the ability of a test agent to 
cause a statistically significant change in substrate conversion by a histone deacetylase 
can be measured, and as desirable, inhibition constants, e.g., values, can be calculated 
The histone substrate can be provided as a purified or semi-purified polypeptide or as 
part of a cell lysate. Likewise, the histone deacetylase can be provided to the reaction 
mixture as a purified or semi-purified polypeptide or as a cell lysate. Accordingly the 
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reaction mixtures of the subject method can range from reconstituted protein mixtures 
denved with purified preparations of histones and deacetylases, to mixtures of ceU 
lysates, e.g., by admixing baculovirus lysates containing recombinant histones and 
deacetylases. 

5 In an exemplary embodimem, the histone substrate for the subject assay is 

provided by isolation of radiolabeled histones from metabolically labelled cells To 
illustrate, as described by Hay et al. (1983) J Biol Chem 258:3726-3734, HeLa cells can 
be labelled in culture by addition of [^HJacetate (New England Nuclear) to the culture 
media. The addition of butyrate, trapoxin or the like can be used to increase the 
1 0 abundance of acetylated histones in the cells. Radiolabelled histones can be isolated from 
the cells by extraction with T^SO^ (Marushige et al. (1966) J Mol Biol 15.160-174). 
Briefly, cells are homogenized in buffer, centrifuged to isolate a nuclear pellet, the 
subsequently homogenized nuclear pellet centrifuged through sucrose, and the resulting 
chromatin pellet extracted by addition of HsS04 to yield [^HJacetyl-labelled histones. In 
an alternate embodiment, nucleosome preparations containing [3H]acetyl-Iabelled 
histones can be isolated from the labelled cells. As described in the art, nucleosomes can 
be isolated from cell preparations by sucrose gradient centrifijgation (Hay et al. (1983) / 
Biol Chem 258:3726-3734; and Noll {1961) Nature 215:360-363), and polynucleosomes 
can be prepared by NaCl precipitation from micrococcal nuclease digested cells (Hay et 
al.. supra). Similar procedures for isolating labelled histones from other cells types, 
including yeast, have been described. See, for example, AJonso et al. (1986) Biochem 
Biophys Acta S66:l6l-169; and Kreiger et al. (1974) /^/o/ C//e/« 249:332-334. In yet 
other embodiments, the histone is generated by recombinant gene expression, and 
mcludes an exogenous tag (e.g.. an HA epitope, a poly(his) sequence or the like) which 
facilitates in purification from cell extracts. In still other embodiments, whole nuclei can 
be isolated from metabolically labelled cells by micrococcal nuclease digestion (Hay et al., 
supra) 

In still another embodiment, the deacetylase substrate can be provided as an 
acetylated peptide including a sequence corresponding to the sequence about the specific 
lysyl residues acetylated on histone, e.g., a peptidyl portions of the core histones H2A, 
H2B, H3 or H4. Such fragments can be produced by cleavage of acetylated histones 
denved from metabolically labelled cells, e.g., such as by treatment with proteolytic 
enzymes or cyanogen bromide (Kreiger et al., supra). In other embodiments, the 
acetylated peptide can be provided by standard solid phase synthesis using acetylated 
35 lysine residues (Kreiger et a!., supra). 
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'^""""'"8 illustrative use of [3H]ac«yI-labelled hi.,o„es. the aetivity cf 

a h, tone deacetyhj. ,„ .he subject assays is detected by measuHng release of t3H]acI,e 

P^v Kied vvhtch compnses a recombinant ^H). protein suspended in buffer, along with a 
5 sample of t'HJacetyl-labeUed histones and (optionally) a test compound. The Lcl 
mixture ,s maintained at a desired temperature and pH, such as 22"C at pH7 8 for 

Xed r";:; ^ ""'""^ - of deLturalion. 

mraure can be acd.fied wtth concentrated HCl, and used to create a biphasic mixture 

the ethyl acetate phase collected and counted by standard scintillation methods Other 
methods for detecting acetate release will be easily recognized by those skilled in the art. 
In yet another embodiment, the drtig screening assay is derived ,o include a whole 

'^f ^pressing one or more of a target protein or /ffixproteia The ability 

15 of a test agent to alter the activity of the HOc protein can be detected by analysis of tht 

Tb rr It ^"'^^ of '"^ "i**-'^ activity 

ce" GenlLT, h" " °' (Ph-otype) of th! 

cell. GeneriU techmques for detecting each are well known, and will va,^ with respect to 
the source of the particular reagent cell utilized in any given assay. 

20 For example, quantification of proliferation of cells in the presence and absence of 

a candidate agent can be measured with a number of techniques well known in the art 
mdudtng simple measurement of population growth curves. For instance, where the 
assay involves proliferation in a liquid medium, turbidimetric techmques (i.e. absorbence/ 

25 ^ ' « --P'^) - ^e utilized. Fo 

TfZ , "T" " ' measurement of absorbence 

of hg^t at a wavelength between 540 and 600nm can provide a conveniently fas, measure 
Of cell growth. Likewise, abihty to fo™ colonies in solid medium (eg. agar) can be used 
° a w« embodiments, an HD. substi^te protein, such 

30 , . . "'""'""^ ' "'^'='> permits the substrate to be 

30 solated from cell lysates and the degree of acetylation detected. Each of these 
echmques are suitable for high through-put analysis necessary for rapid screening of 
large numbers of candidate agents. ecning oi 

In addition, where the ability of an agem to cause or reverse a transfonned 
phenotype. growth in soKd media such as agar can fiirther aid in establishing whether a 
35 mammalian cell is transformed. g wnetner a 
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Additionally, visual inspection of the morphology of the reagent cell can be used 
to determine whether the biological activity of the targeted HDx protein has been affected 
by the added agent. To illustrate, the ability of an agent to influence an apoptotic 
phenotype which is mediated in some way by a recombinant HDx protein can be assessed 
by visual microscopy. Likewise, the formation of certain cellular structures as part of 
differentiation, such as the formation of neuritic process, can be visualized under a light 
microscope. 

The nature of the effect of test agent on reagent cell can be assessed by measuring 
levels of expression of specific genes, e.g., by reverse transcription-PCR. Another 
method of scoring for effect on activity is by detecting cell-type specific marker 
expression through immunofluorescent staining. Many such markers are known in the 
art, and antibodies are readily available. For example, the presence of chondroitin 
sulphate proteoglycans as well as type-II collagen are correlated with cartilage 
production in chondrocytes, and each can be detected by immunostaining. Similarly, the 
human kidney differentiation antigen gpl60, human aminopeptidase A, is a marker of 
kidney induction, and the cytoskeletal protein troponin I is a marker of heart induction. 
In yet another embodiment, the alteration of expression of a reporter gene construct 
provided in the reagent cell provides a means of detecting the effect on HDx activity. For 
example, reporter gene constructs derived using the transcriptional regulatory sequences, 
20 e.g. the promoters, for developmentally regulated genes can be used to drive the 
expression of a detectable marker, such as a luciferase gene. In an illustrative 
embodiment, the construct is derived using the promoter sequence fi-om a gene expressed 
in a particular differentiative phenotype. 

It is also deemed to be within the scope of this invention that the recombinant 
25 HDx cells of the present assay can be generated so as to comprise heterologous HDx 
proteins (i.e. cross-species expression). For example, HDx proteins from one species can 
be expressed in the cells of another under conditions wherein the heterologous protein is 
able to rescue loss-of-fiinction mutations in the host cell. For example, the reagent cell 
can be a yeast cell in which a human MDx protein (e.g. exogenously expressed) is the 
30 intended target for development of an anti-proliferative agent. To illustrate, the M778 
strain, K4ATa ura3-52 trpJAl his3-200 Ieu2-J trkJA rpd3A::HIS3, described by Vidal et 
al. (1991) Mol Cell Biol 6317-6327, which lacks a fimctional endogenous RPD3 gene 
can be transfected with an expression plasmid including a mammalian HDx gene in order 
to complement the RPD3 loss-of-fiinction. For example, the coding sequence for HD J 
can be cloned into a pRS integrative plasmid containing a selectable marker (Sikorski et 
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al. (1989) Genetics 122:19-27), and resulting construct used to transform the M778 
strain. The resulting cells should produce a mammalian HDl protein which may be 
capable performing at least some of the functions of the yeast RPD3 protein The HDx 
transformed yeast cells can be easier to manipulate than mammalian cells, and can provide 
5 access to certain assay formats, such as turbidity detection methods, which may not be 
obtainable with mammalian cells. 

Moreover, the combination of the "mammalianized" strain with the strain M537 
iMATa ura3-52 trplAI his5-200 leu2-l trklA, Vidal et al., supra) can provide an 
exquisitely sensitive cell-based assay for detecting agent which specifically inhibit for 
10 example, the yeast RPD3 deacetylase. 

In another aspect, the invention provides compounds useful for inhibition of 
HDxs. In a preferred embodiment, an HDx inhibitor compound of the invention can be 
represented by the formula A-B-C. in which A is a specificity element for selective 
binding to an HDx, B is a linker element, and C is an electrophilic moiety capable of 
reacting with a nucleophilic moiety of an HDx- with the proviso that the compound is not 
butyrate, trapoxin, or trichostatin. 

In another aspect, the invention provides an affinity matrix for binding or 
purifying an HDx. In a preferred embodiment, the affinity matrix can be represented by 
the formula S-A-B-C, in which S is a solid or insoluble support, and A, B, and C are as 
descnbed above. The solid or insoluble support S can be any of a variety of supports 
many of which are known in the art, for synthesis of, or immobilization of, compounds 
e.g., peptides, benzodiazepines, and the like. For a review of solid-supported synthesis,' 
see. e.g., Hodge et al, Polymer-supported Reactions in Organic Synthesis, John Wiley 
& Sons, New York, 1980. The HDx inhibitor moiety A-B-C can be bonded directly to 
the support S, or can be bonded to the support S through a linking or spacing moiety, as 
IS known in the art. 

In another aspect, the invention provides a method of inhibiting an HDx The 
method comprises contacting the HDx with a compound capable of inhibiting HDx 
activity, under conditions such that HDx activity is inhibited. In preferred embodiments 
the compounds can be represented by the formula A-B-C, in which A, B, and C are as 
descnbed above; with the proviso that the compound is not butyrate. trapoxin or 
trichostatin. 

In another aspect, the invention provides a method of purifying an HDx The 
method includes contacting a reaction mixture comprising an HDx with an affinity matrix 
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capable of selectively binding to an HDx, and separating at least one other component of 
the reaction mixture from the HDx. In a preferred embodiment, the affinity matrix can be 
represented by the formula S-A-B-C, in which S, A, B, and C are as described above. 

In general, the elements A, B, and C of the inhibitor compounds are selected to 
5 permit selective binding to, and inhibition of, at least one HDx. The elements A, B, and 
C can be selected to provide specificity for particular HDx^. For example, a series of 
candidate HDx inhibitor compounds can be synthesized, e.g., according to the 
combinatorial methods described infra, and the library of candidate compounds screened 
against one or more HDx^ to determine the compound or compounds with optimal 
1 0 activity and specificity for a particular HDx. 

Thus, in preferred embodiments, the specificity element A is selected such that the 
HDx inhibitor compound binds selectively to an HDx. In general, the specificity element 
A will be selected according to factors such as the binding specificity of the HDx or HDxs 
to which the inhibitor compound should bind, ease of synthesis, stability in vivo or in 
15 vitro, and the like. In certain embodiments, the specificity element A is a 
cyclotetrapeptidyl moiety. In another embodiment, A is a substituted or unsubstituted 
aryl moiety. In yet another embodiment, A is a nonaromatic carbocycle. In still another 
embodiment. A is an amino acyl moiety (e.g.. a natural or non-natural amino acyl 
moiety). In yet another embodiment, A is a heterocyclyl moiety. 

In preferred embodiments, B is selected from the group consisting of substituted 
and unsubstituted C4-C8 alkylidene, C4-C8 alkenylidene, C4-C8 alkynylidene, and D-E-F, 
in which D and F are independently absent or C2-C7 alkylidene, C2-C7 alkenylidene, or 
C2-C7 alkynylidene, and E is O, S, or NR', in which R' is H, lower alkyl, lower alkenyl, 
lower alkynyl, aralkyl, aryl, or heterocyclyl. The element B should be selected to permit 
the specificity element A to interact with an HDx such that specific binding occurs, while 
poising the electrophilic moiety C for reaction with a nucleophilic moiety of theiTOx. 

In a preferred embodiment, C is an electrophilic moiety that is approximately 
isosteric with an N-acetyl group (i.e., C has approximately the same steric bulk as an N- 
acetyl group)In preferred embodiments, the element C is capable of reacting, covalently 
or non-covalently. with a nucleophilic moiety of an HDx. In certain preferred 
embodiments, the element C is capable of binding (e.g., by chelation) to a metal ion, e.g., 
a divalent metal ion, e.g., zinc or calcium. In preferred embodiments, C is selected from 
the group consisting of a, P-epoxy ketones, a,|3-epoxythioketones, a,P-epoxysulfoxides, 
hydroxamic acids, a-haloketones, a-halothioketones, a-diazoketones, a- 
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diazothioketones. vinyl epoxides, trifluoromethylketone, trifluoromethylthioketone, 
enones (e.g., of ketones or thioketones), ynones (e.g., of ketones or thioketones), a,P- 
aziridinoketones, hydrazones, boronic acids, carboxylates, amides (e.g., -C(O)-amin'o), 
sulfones, aldehyde, alkyl halides, epoxides, and the like. 

In accordance with the foregoing, the moieties A, B, and C can illustratively be 
represented by the formulas depicted in Figure 6, in which Rj represents one or more 
substituents selected from the group consisting of amino, halogen, alkyl, alkenyl, alkynyl, 
aryl, aralkyl, heterocyclyl, azido, carboxyl, alkoxycarbonyl, hydroxyl, alkox^, cyano,' 
trifluoromethyl, and the like; R" is Cj-Cg alkylidene, C2-C8 alkenylidene, or C2-C8 
alkynylidene; R5 is hydrogen, alkyl, alkoxycarbonyl, aryloxycarbonyl, alkylsulfonyl, 
arylsulfonyl or aryl; R^ is hydrogen, alkyl, aryl, alkoxy, aryloxy, halogen, and the like; R'g 
is hydrogen, alkyl, alkenyl, alkynyl, aryl, and the like; R7 is hydrogen, alkyl, aryl, alkoxy, 
aryloxy, amino, hydroxylamino, alkoxylamino, halogen, and the like; Rg is hydrogen,' 
alkyl, halogen, and the like; R9 is hydrogen, alkyl, aryl, hydroxyl, alkoxy, aryloxy, amino,' 
and the like; X is a good leaving group, e.g., diazo, halogen, a sulfate or sulfonate ester, 
e.g., a tosylate or mesylate, and the like; and Y is O or S. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consisting of cycloalkyls, 
unsubstituted and substituted aryls, heterocyclyls, amino acyls, and cyclotetrapeptides; B 
is selected from the group consisting of substituted and unsubstituted C4-C8 alkylidene, 
C4-C8 alkenylidene, C4-C8 alkynylidene, C4-C8 enyne, and D-E-F, in which D and F are 
independently absent or a C-C7 alkylidene, an C2-C7 alkenylidene, or an C2-C7 
alkynylidene, and E is O, S, or NR', in which R' represents H, a lower alkyl, a lower 
alkenyl, a lower alkynyl, an aralkyl, aryl, or a heterocyclyl; and C is selected from the 
25 group consisting of 

, and B(OH)2 (boronic 

acid); in which Z represents O, S, or NR5, and Y, R5, R'g, and R7 are as defined above. 
In preferred embodiments, R'g is hydrogen. In certain prefen-ed embodiments, B is not a 
C4.C8 alkylidene. In prefen-ed embodiments, if B is a C4-C8 alkylidene, C is not a 
boronic acid. In other preferted embodiments, the inhibitor compound is not trapoxin. 

In certain preferred embodiments, an HDx inhibitor compound can be represented 
by the formula A-B-C, in which A is selected from the group consisting of cycloalkyls. 
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unsubstituted and substituted aryls. heterocyclyls, amino acyls, and cyclotetrapeptides; B 
IS selected from the group consisting of substituted and unsubstituted C4-C8 aJkylideiie, 
C4-C8 alkenylidene, C4-C8 alkynylidene, C4.C8 enyne. and D-E-F, in which D and F are 
independently absent or C1-C7 alkylidene, C2-C7 alkenylidene, or C2-C7 alkynylidene, 
and E is O, S, or NR'. in which R' represents H, a lower alkyl, a lower alkenyl, a lower 
alkynyl, an aralkyl, an aryl, or a heterocyclyl; and C is selected from the group consisting 
of 



Y 



II 0 



H H 6 . . 

, tn which R9 is as defined above. In preferred 

embodiments, B is not a C4-C8 alkylidene. In preferred embodiments, the inhibitor 
10 compound is not trichostatin. 

In still another preferred embodiment, an HDx inhibitor compound can be 
represented by the formula A-B-C, in which A is selected from the group consisting of 
cycloalkyls, unsubstituted and substituted aryls, heterocyclyls, amino acyls, and 
cyclotetrapeptides; B is selected from the group consisting of substituted and 
unsubstituted C4-C8 alkylidene, C4-C8 alkenylidene, C4-C8 alkynylidene, C4-C8 enyne, 
and D-E-F, in which D and F are independently absent or a C1-C7 alkylidene, a C2-C7 
alkenylidene, or a C2-C7 alkynylidene, and E is O, S, or NR', in which R' is H, lower 

Y 




alkyl, lower alkenyl, lower alkynyl. aralkyl, aryl, or heterocyclyl; and C is 
which Y is O or S, and R7 is as defined above. 

Certain HDx inhibitor compounds of the present invention may exist in particular 
geometric or stereoisomeric forms. For example, amino acids can contain at least one 
chiral center. The present invention contemplates ail such compounds, including cis- and 
trans-isomers, R- and S-enantiomers, diastereomers, the racemic mixtures thereof, and 
other mixtures thereof, as falling within the scope of the invention. Additional asymr^etric 
carbon atoms may be present in a substituent such as an alkyl group. All such isomers, as 
well as mixtures thereof, are intended to be included in this invention. 

If, for instance, a particular enantiomer of a compound of the present invention is 
desired, it may be prepared by asymmetric synthesis, or by derivation with a chiral 
auxiliary, where the resulting diastereomeric mixture is separated and the auxiUary group 
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cleaved to provide the pure desired enantiomer. Alternatively, where the molecule 
contains a basic functional group, such as amino, or an acidic functional group, such as 
carboxyl, diastereomeric salts can be formed with an appropriate optically-active acid or 
base, followed by resolution of the diastereomers thus formed by fractional crystallization 
5 or chromatographic means well known in the art, and subsequent recovery of the pure 
enantiomers. 

The term "alkyl" refers to the radical of saturated aliphatic groups, including 
straight-chain alkyl groups, branched-chain alkyl groups, cycloalkyl (alicyclic) groups, 
alkyl substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. In preferred 

10 embodiments, a straight chain or branched chain alkyl has 30 or fewer carbon atoms in its 
backbone (e.g., Cj-Cjo for straight chain, C3-C30 for branched chain), and more 
preferably 20 or fewer. Likewise, preferred cycloalkyls have from 4-10 carbon atoms in 
their ring structure, and more preferably have 5, 6 or 7 carbons in the ring structure. 

Unless the number of carbons is otherwise specified, "lower alkyl" as used herein 

15 means an alkyl group, as defined above, but having from one to ten carbons, more 
preferably from one to six carbon atoms in its backbone structure. Likewise, "lower 
alkenyl" and "lower alkynyl" have similar chain lengths. Preferred alkyl groups are lower 
alkyls. In preferred embodiments, a substituent designated herein as alkyl is a lower 
alkyl. 

20 Moreover, the term "alkyl" (or "lower alkyl") as used throughout the specification 

and claims is intended to include both "unsubstituted alkyls" and "substituted alkyls", the 
latter of which refers to alkyl moieties having substituents replacing a hydrogen on one or 
more carbons of the hydrocarbon backbone. Such substituents can include, for example, 
halogen, hydroxyl, carbonyl (such as a carboxylate, alkoxycarbonyl, aryloxycarbonyl, 
25 alkylcarbonyl, arylcarbonyl, aldehyde, and the like), thiocarbonyl (such as a thioacid,' 
alkoxycarbonyl, and the like), an alkoxyl, unsubstituted amino, mono- or disubstituted 
amino, amido, amidine, imine, nitro, azido, sulfhydryl, alkylthio, cyano, trifluoromethyl, 
sulfonate, sulfamoyl, sulfonamide, heterocyclyl, aralkyl, or an aromatic or heteroaromatic 
moiety. It will be understood by those skilled in the art that the moieties substituted on 
30 the hydrocarbon chain can themselves be substituted, as described above, if appropriate. 
Exemplary substituted alkyls are described below. Cycloalkyls can be fiirther substituted 
with, e.g., alkyls, alkenyls, alkoxys, alkylthios, aminoalkyls, carbonyl-substituted alkyls - 
CF3, -CN, and the like. 
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The terms "alkenyl" and "alkynyl" refer to unsaturated aliphatic groups analogous 
in length and possible substitution to the alkyls described above, but that contain at least 
one double or triple bond respectively. The term "enyne" refers to an unsaturated 
aliphatic moiety having at least one double bond and one triple bond. 

The terms "alkyhdene," "alkenylidene," and "alkynylidene" are art-recognized and 
refer to moieties corresponding to alkyl, alkenyl, and alkynyl moieties as defined above, 
but having two valences available for bonding. 

The term "aryl" as used herein includes 5-, 6- and 7-membered single-ring 
aromatic groups that may include from zero to four heteroatoms, for example, phenyl, 
pyrrolyl, furanyl, thiophenyl, imidazolyl, oxazolyl, thiazolyl, triazolyl, pyrazolyl, pyridyl, 
pyrazinyl, pyridazinyl and pyrimidyl, and the like. Those aryl groups having heteroatoms 
in the ring structure may also be referred to as "aryl heterocycles" or "heteroaromatics". 
The aromatic ring can be substituted at one or more ring positions with such substituents 
as described above, as for example, halogen, azido, alkyl, aralkyl, alkenyl, alkynyl, 
cycioalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, carbonyl, carboxyl, silyl, 
ether, alkylthio, suifonyl, sulfonamide, ketone, aldehyde, ester, a heterocyclyl, an 
aromatic or heteroaromatic moiety, -CF3, -CN, or the like. 

The term "aralkyl", as used herein, refers to an alkyl group substituted with an 
aryl group (e.g., an aromatic or heteroaromatic group). 

The terms "heterocyclyl" or "heterocyclic group" refer to non-aromatic 4- to 10- 
membered ring structures, more preferably 4- to 7-membered rings, which ring structures 
include one to four heteroatoms (e.g., O, N, S, P and the like). Heterocyclyl groups 
include, for example, pyri-olidine, oxolane, thiolane, imidazole, oxazole, piperidine, 
piperazine, morpholine, lactones, lactams such as azetidinones and pyrrolidinones, 
sultams, sultones, and the like. The heterocyclic ring can be substituted at one or more 
positions with such substituents as described above, as for example, halogen, alkyl, 
aralkyl, alkenyl, alkynyl, cycioalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, 
alkoxycarbonyl, aryloxycarbonyl, carboxyl, silyl, ether, alkylthio, alkylsulfonyl, 
aiylsulfonyl, ketone (e.g., -C(0)-alkyI or -C(O)-aryl), aldehyde, heterocyclyl, an aryl or 
30 heteroaryl moiety, -CF3, -CN, or the like. 

Compounds represented by the formula A-B-C, in which A, B, and C have the 
values described supra, can be synthesized by standard techniques of organic synthesis. 
For example, precursor synthons corresponding to each of the moieties A, B, and C, or 
subunits thereof, can be coupled in linear or convergent syntheses to provide HDx 
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inhibitor compounds, or compounds readily converted thereto. Syntheses of the HDx 
inhibnor compound trichostatin, and related compounds, have been reported- see e« 
Massa, S. et al. (1990) J. Med. Chem. 33:2845-49; Mori, K, and Kosecki, K. (1988) 
Tetrahedron 44:6013-20; Koseki, K. and Mori. K. European Patent Application EP 
331524 A2; Fleming. I. et al (1983) Tetrahedror^ 39:841-46. Analogs of trapoxin have 
83.324 27''"'^^'^^'^' ^"^''^ ^ (1992) J. Carreer Res. 

Thus, in an illustrative synthesis, a compound represented by the formula A-B-C 
m which A IS an phenyl group, while B and C can have a variety of values can be 

synthesized as shown below; 



aM?C acylation 
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Scheme I 

According to the Scheme, a functionalized organometallic aryl compound 
(MX-organometallic moiety; R is any substituent; X is a leaving group, e.g., halogen) 
(e.g.. organotm, boronate, aryUithium. cuprate, Grignard reagent, etc.) is alkylated or 
acylated to provide functionalized compounds (e.g., the exemplary compounds 1 2 or 3) 
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which can be further elaborated to provide compounds with a wide variety of substituents 
and carbon backbones. Other A moieties (e.g., specificity elements) can be obtained by 
use of appropnate synthons. e.g., by substituting vinylorganometallic compounds for the 
organometallic aryl compound of the Scheme (followed by further treatment eg 
reduction, of the vinyl group, if desired, to yield an alkyl A moiety). By way of 
Illustration, as shown for compound i, the carbonyl group can be used for elaboration, 
e.g., by reduction of the carbonyl group to an alcohol, conversion of the alcohol to a 
tosylate. and nucieophilic displacement of the tosyiate by an acyl compound (eg a 
ketone or ester) to provide a chain-lengthened product (Route A), which can 'be 
converted to a C(0)X functionality (e.g., by hydrolysis of an ester and conversion of the 
resulting carboxylic acid to an acid chloride). Alternatively, the carbonyl group of I can 
be used for olefination (Route B), e.g., Homer-Emmons olefination, to provide an 
elaborated alkenyl compound. Also, the carbonyl group can be converted to an alkynyl 
functionality, e.g., via the Corey-Fuchs procedure, to provide an elaborated alkynyl 
compound. For purposes of clarity, only certain chain lengths and functional group 
patterns are shown in the scheme; however, the skilled artisan will appreciate that many 
other compounds, with a variety of B moieties (i.e., linking moieties), can be synthesized 
through analogous procedures. The C(0)X functionality (e.g., an acid chloride where X 
IS CI) can be converted to functional groups such as amide, hydrazido 
tnfluormethylketone, enone. epoxide, aziridine, and the like, through methods 
conventional in the art. Thus, the synthetic pathways shown in the Scheme provide 
access to compounds having a variety of C moieties (e.g., reactive moieties) suitable for 
substitution in the subject HDx inhibitors. 

In vitro chemical synthesis provides a method for generating libraries of 
compounds that can be screened for ability to bind to or inhibit a target protein, e.g., an 
HDx. Although in vitro methods have previously been used in the pharmaceutical 
industry to identify potential drugs, recently developed methods have focused on rapidly 
and efficiently generating and screening large numbers of compounds and are amenable to 
generating HDx inhibitor compound libraries for use in the subject method. The various 
approaches to simultaneous preparation and analysis of large numbers of compounds 
(herein "combinatorial synthesis") each rely on the fundamental concept of synthesis on a 
solid support introduced for peptides by Merrifield in 1963 (Merrifield, R.B. (1963) J Am 
Chem Soc 85:2149-2154). Many types of solid matrices have been successfully used in 
sohd-phase synthesis, and can be selected according to the type of chemistry to be 
performed on the immobilized moieties, as is discussed in more detail below. 
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Several synthetic schemes have been suggested or employed for the combinatorial 
synthesis of.organic compounds (see, e.g., E.M. Gordon et al, J. Med Chem 37- 1385- 
1401 (1994)). 
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Multipin Synthesis 

One method for combinatorial synthesis of compounds is the multipin synthesis 
method. Briefly, Geysen and co-workers (Geysen et al. (1984) PNAS 81:3998-4002) 
introduced a method for generating compounds by a parallel synthesis on polyacrylic 
acid-grated polyethylene pins arrayed in the microtitre plate format. In the original 
experiments, about 50 nmol of a single compound was covalently linked to the spherical 
head of each pin, and interactions of each compound with a receptor or antibody could be 
determined in a direct binding assay. The Geysen technique can be used to synthesize 
and screen thousands of compounds per week using the multipin method, and the 
tethered compounds may be reused in many assays. In subsequent work, the level of 
compound loading on individual pins has been increased to as much as 2 nmol/pin by 
grafting greater amounts of functionalized acxylate derivatives to detachable pin heads, 
and the size of the compound library has been increased (Valerio et al. (1993) Ini J Pept 
Protein Res 42: 1-9). Appropriate linker moieties have also been appended to the pins so 
that the compounds may be cleaved from the supports after synthesis for assessment of 
purity and evaluation in competition binding or ftinctional bioassays (Bray et al. (1990) 
Tetrahedron Lett 31:5811-5814; Valerio et al. {\99\) Anal Biochem 197:168-177; Bray 
et al. (1991) Tetrahedron Lett 32:6163-6166). 

More recent applications of the multipin method have taken advantage of the 
cleavable linker strategy to prepare soluble compound libraries (Maeji et al. (1990) J 
Immunol Methods 134:23-33; Gammon et al. (1991) J Exp Med 173:609-617; Mutch et 
al. (1991) Pep/ 4:132-137). 

Divide-Couple-Recombine 

In another embodiment, a variegated library of HDx inhibitor compounds is 
provided on a set of beads utilizing the strategy of divide-couple-recombine (see, e.g., 
Houghten (1985) PNAS 82:5131-5135; and U.S. Patents 4,631,211; 5,440,016^ 
5,480,971). Briefly, as the name implies, at each synthesis step where degeneracy (e.g., a 
plurality of different moieties) is introduced into the library, the beads are divided into as 
many separate groups to correspond to the number of different residues (e.g., ftinctional 
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groups or other moieties) to be added at that position, the different residues coupled in 
separate reactions, and the beads recombined into one pool for the next step. 

In one embodiment, the divide-couple-recombine strategy can be carried out 
using the so-called "tea bag" method first developed by Houghten. where synthesis 
occurs on resin that is sealed inside porous polypropylene bags (Houghten et al. (1986) 
PNAS 82:5 13 1-5 135). Residues are coupled to the resins by placing the bags in solutions 
of the appropriate individual activated monomers, while all common steps such as resin 
washing and deprotection (if appropriate) are performed simultaneously in one reaction 
vessel. At the end of the synthesis, each bag contains a single compound, and the 
compounds may be liberated from the resins using a multiple cleavage apparatus 
(Houghten et al. (1986) Int J Pept Protein Res 21:613-61 This technique offers 
advantages of considerable synthetic flexibility and has been partially automated (Beck- 
Sickinger et al. (1991) Pept Res 4:88-94). Moreover, compounds can be produced in 
sufficient quantities (> 500 |.imol) for purification and complete characterization if 
1 5 desired. 

Synthesis using the tea-bag approach is usefijl for the production of a library, 
albeit of limited size, as is illustrated by its use in a range of molecular recognition 
problems including antibody epitope analysis (Houghten et al. (1986) PNAS 82:5131- 
5135), peptide hormone structure-function studies (Beck-Sickinger et al. (1990) Int J 
20 Pept Protein Res 36:522-530; Beck-Sickinger et al. (1990) EurJBiochem 194:449-456), 
and protein conformational mapping (Zimmerman et al. (1991) Eur J Biochem 200 519- 
528). 



25 Combinatorial Synthesis on Nontraditional Solid Supports 

The search for innovative methods of solid-phase synthesis has led to the 
mvestigation of alternative polymeric supports to the polystyrene-divinylbenzene matrix 
originally popularized by Merrifield, Cellulose, either in the form of paper disks 
(Blankemeyer-Menge et al. (1988) Tetrahedron Lett 29-5871-5874; Frank et al. (1988) 
Tetrahedron 44:6031-6040; Eichler et al. (1989) Collect Czech Chem Commun 54:1746- 
1752; Frank, R. (1993) Bioorg Med Chem Lett 3:425-430) or cotton fragments (Eichler 
et al. (1991) Pept Res 4:296-307; Schmidt et al. (1993) Bioorg Med Chem Lett 3:441- 
446) has been successfully functionalized for peptide synthesis. Typical loadings attained 
with cellulose paper range from 1 to 3 nmol/cm^, and HPLC analysis of material cleaved 
from these supports indicates a reasonable quality for the synthesized peptides. 
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Alternatively, peptides may be synthesized on cellulose sheets via non-cleavable linkers 
and then used in ELISA-based binding studies (Frank, R. (1992) Tetrahedron 48:9217- 
9232). The porous, polar nature of this support may help suppress unwanted nonspecific 
protein binding effects. In one convenient configuration synthesis occurs in an 8 x 12 
5 microtiter plate format. Frank has used this technique to map the dominant epitopes of 
an antiserum raised against a human cytomegalovirus protein, following the overlapping 
peptide screening (Pepscan) strategy of Geysen (Frank, R. (1992) Tetrahedron 48:9217- 
9232). Other membrane-like supports that may be used for solid-phase synthesis include 
polystyrene-grafted polyethylene films (Berg et al. (1989) J Am Chem Soc 111-8024- 
10 8026). 

Combinatorial Libraries by Light-Directed Spatially Addressable Parallel Chemical 
Synthesis 

A scheme of combinatorial synthesis in which the identity of a compound is given 
by its locations on a synthesis substrate is termed a spatially-addressable synthesis. In one 
embodiment, the combinatorial process is carried out by controlling the addition of a 
chemical reagent to specific locations on a solid support (Dower et al. (1991) Annu Rep 
Med Chem 26:271-280; Fodor, S.P.A. (1991) Science 251:767; Piming et al. (1992) 
U.S. Patent No. 5,143,854; Jacobs et al. (1994) Trends Biotechnol 12:19-26). The 
technique combines two well-developed technologies: solid-phase synthesis chemistry 
and photolithography. The high coupling yields of solid-phase reactions allows efficient 
compound synthesis, and the spatial resolution of photolithography affords 
miniaturization. The merging of these two technologies is done through the use of 
photolabile protecting groups, e.g., amino protecting groups, in the synthetic procedure. 

The key points of this technology are illustrated in Gallop et al. (1994) J Med 
Chem 37:1233-1251. A synthesis substrate is prepared for compound synthesis through 
the covalent attachment of photolabile nitroveratryloxycarbonyl (NVOC) protected 
amino linkers. Light is used to selectively activate a specified region of the synthesis 
support for coupling. Removal of the photolabile protecting groups by lights 
(deprotection) results in activation of selected areas. After activation, the first of a set of 
residues, each bearing a photolabile protecting group, is exposed to the entire surface. 
Coupling only occurs in regions that were addressed by light in the preceding step. The 
reagent solution is removed, and the substrate is again Uluminated through a second 
mask, activating a different region for . reaction with a second protected building block. 
The pattern of masks and the sequence of reactants define the products and their 
locations. Since this process utilizes photolithography techniques, the number of 
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compounds that can be synthesized is limited only by the number of synthesis sites that 
can be addressed with appropriate resolution. The position of each compound is 
precisely known; hence, its interactions with other molecules can be directly assessed. 
The target can be labeled with a fluorescent reporter group to facilitate the identification 
of specific interactions with individual members of the matrix. 

In a light-directed chemical synthesis, the products depend on the pattern of 
illumination and on the order of addition of reactants. By varying the lithographic 
patterns, many different sets of test compounds can be synthesized in the same number of 
steps; this leads to the generation of many different masking strategies. 

Encoded Combinatorial Libraries 

In yet another embodiment, the subject method provides an HDx inhibitor 
compound library provided with an encoded tagging system. A recent improvement in 
the identification of active compounds firom combinatorial libraries employs chemical 

15 indexing systems using tags that uniquely encode the reaction steps a given bead has 
undergone and, by inference, the structure it carries. Conceptually, this approach mimics 
phage display libraries, where activity derives fi-om expressed peptides, but the structures 
of the active peptides are deduced from the corresponding genomic DNA sequence. The 
first encoding of synthetic combinatorial libraries employed DNA as the code. Two forms 

20 of encoding have been reported: encoding with sequenceable bio-oligomers (e.g., 
oligonucleotides and peptides), and binary encoding with non-sequenceable tags. 

Tagging with sequenceable bio-oligomers 

The principle of using oligonucleotides to encode combinatorial synthetic libraries 
25 was described in 1992 (Brenner et al. (1992) PNAS 89:5381-5383), and an example of 
such a library appeared the following year (Needles et al. (1993) PNAS 90: 10700-10704). 
A combinatorial library of nominally 7^ (= 823,543) peptides composed of all 
combinations of Arg, Gin, Phe, Lys, Val, D-Val and Thr (three-letter amino acid code), 
each of which was encoded by a specific dinucleotide (TA, TC, CT, AT, TT, CA and 
AC, respectively), was prepared by a series of alternating rounds of peptide and 
oligonucleotide synthesis on solid support. In this work, the amine linking functionality 
on the bead was specifically differentiated toward peptide or oligonucleotide synthesis by 
simultaneously preincubating the beads with reagents that generate protected OH groups 
for oligonucleotide synthesis and protected NHj groups for peptide synthesis (here, in a 
35 ratio of 1:20). When complete, the tags each consisted of 69-mers, 14 units of which 
carried the code. The bead-bound library was incubated with a fluorescently labeled 
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anubody, and beads containing bound antibody that fluoresced strongly were han^ested 
by fluorescence-activated cell sorting (FACS). The DNA tags were amplified by PGR and 
sequenced, and the predicted peptides were synthesized. Following such techniques 
HDx inhibitor compound libraries can be derived and screened using HDx^ of the subject 
5 invention. 

It is noted that an alternative approach useful for generating nucleotide-encoded 
synthetic peptide libraries employs a branched linker containing selectively protected OH 
and NH2 groups (Nielsen et al. (1993) J Am Chem Soc 1 15:9812-9813; and Nielsen et al 
(1994) Mefhods Compan Methods Enzymol 6:361-371). This approach requires that 
equimolar quantities of test peptide and tag co-exist, though this may be a potential 
complication m assessing biological activity, especially with nucleic acid based targets. 

The use of oligonucleotide tags permits exquisitely sensitive tag analysis. Even so 
the method requires careful choice of orthogonal sets of protecting groups required for 
alternating co-synthesis of the tag and the library member. Furthermore, the chemical 
lability of the tag, particularly the phosphate and sugar anomeric linkages, may limit the 
choice of reagents and conditions that can be employed for the synthesis on non- 
ohgomenc libraries. In preferred embodiments, the libraries employ linkers permitting 
selective detachment of the test HDx inhibitor compound library member for bioassay in 
part (as descnbed infra) because assays employing beads limit the choice of targets and 
in part because the tags are potentially susceptible to biodegradation. 

Peptides themselves have been employed as tagging molecules for combinatorial 
libraries. Two exemplaiy approaches are described in the art, both of which employ 
branched linkers to solid phase upon which coding and ligand strands are alternately 
elaborated. In the first approach (Kerr JM et al. (1993) J Am Chem Soc 1 15 2529-253 1) 
orthogonality in synthesis is achieved by employing acid-labile protection for the coding 
strand and base-labile protection for the ligand strand. 

In an alternative approach (Nikolaievetal. ^1992) Pept Res 6:\6\~\1Q) branched 
linkers are employed so that the coding unit and the test peptide are both attached to the 
same functional group on the resin. In one embodiment, a linker can be placed between 
the branch point and the bead so that cleavage releases a molecule containing both code 
and ligand (Ptek et al. (1991) Tetrahedron Lett 32:3891-3894). In another embodiment 
the linker can be placed so that the test peptide can be selectively separated from the 
bead, leaving the code behind. This last construct is particularly valuable because it 
permits screening of the test peptide without potential interference, or biodegradation, of 
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the coding groups. Examples in the art of independent cleavage and sequencing of 
peptide library members and their corresponding tags has confirmed that the tags can 
accurately predict the peptide structure. 

It is noted that peptide tags are more resistant to decomposition during ligand 
5 synthesis than are oligonucleotide tags, but they must be employed in molar ratios nearly 
equal to those of the ligand on typical 130 ^m beads in order to be successfully 
sequenced. As with oligonucleotide encoding, the use of peptides as tags requires 
complex protection/deprotection chemistries. 

10 Non-sequenceable tagging: binary encoding 

An alternative form of encoding the test peptide Ubrary employs a set of non- 
sequenceable tagging molecules (e.g., molecules having electrophoric moieties) that are 
used as a binary code (Ohlmeyer et al. (1993) PNAS 90:10922-10926). Exemplary tags 
are haloaromatic alkyl ethers that are detectable as their trimethylsilyl ethers at less than 
femtomolar levels by electron capture gas chromatography (ECGC). Variations in the 
length of the alkyl chain, as well as the nature and position of the aromatic haUde 
substituents, permit the synthesis of at least 40 such tags, which in principle can encode 
2 (e.g., upwards of 10l2) different molecules. In the original report (Ohlmeyer et al., 
supra) the tags were bound to about 1% of the available amine groups of a peptide 
library via a photocleavable O-nitrobenzyl linker. This approach is convenient when 
preparing combinatorial libraries of peptides or other amine-containing molecules. A 
more versatile system has, however, been developed that permits encoding of essentially 
any combinatorial library. Here, the ligand is attached to the solid support via the 
photocleavable linker and the tag is attached through a catechol ether linker via carbene 
msertion into the bead matrix (Nestler et al. (1994) J Org Chem 59:4723-4724). This 
orthogonal attachment strategy permits the selective detachment of library members for 
bioassay in solution and subsequent decoding by ECGC afler oxidative detachment of the 
tag sets. 

Binary encoding with tags, e.g., electrophoric tags, has been particularly useful in 
defining selective interactions of substrates with synthetic receptors (Borchardt et al 
(1994) J Am Chem Soc 1 16:373-374), and model systems for understanding the binding 
and catalysis of biomolecules. Even using detailed molecular modeling, the identification 
of the selectivity preferences for synthetic receptors has required the manual synthesis of 
dozens of potential substrates. The use of encoded libraries makes it possible to rapidly 
examine all the members of a potential binding set. The use of binary-encoded libraries 
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has made the determination of binding selectivities so facile that structural selectivity has 
been reported for four novel synthetic macrobicyclic and tricyclic receptors in a single 
communication (Wemiemers et al. {1995) J Org Chem 60:1108-1109- and Yoon et al 
(1994) Tetrahedron Lett 35:8557-8560) using the encoded librao' mentioned above' 
^ Similar facihty in defining specificity of interaction would be expected for many other 
biomolecules. 

Although the several amide-linked libraries in the art employ binary encoding with 
the electrophone tags attached to amine groups, attaching these tags directly to the bead 
matnx provides far greater versatility in the stmctures that can be prepared in encoded 
combmatorial libraries. Attached in this way. the tags and their linker are nearly as 
unreactive as the bead matrix itself Two bina,y-encoded combinatorial libraries have 
been reported where the tags are attached directly to the solid phase (Ohlmeyer et al 
(1995) PNAS 92:6027-6031) and provide guidance for generating the subject HI^ 
inhibitor compound libraiy. Both libraries were constructed using an orthogonal 
attachment strategy in which the library member was linked to the solid support by a 
photolabde linker and the tags were attached through a linker cleavable only by vigorous 
oxidation. Because the library members can be repetitively partially photoeluted fi-om the 
solid support, hbraiy members can be utilized in multiple assays. Successive photoelution 
also pennits a very high throughput iterative screening strategy: first, multiple beads are 
placed m 96-well microtiter plates; second, ligands are partially detached and transfen-ed 
to assay plates; third, a bioassay identifies the active wells; fourth, the corresponding 
beads are rean-ayed singly into new microtiter plates; fifth, single active compounds are 
identified; and, sixth, the structures are decoded. 

The above approach was employed in screening for carbonic anhydrase (CA) 
binding and identified compounds which exhibited nanomolar affinities for CA Unlike 
sequenceable tagging, a large number of structures can be rapidly decoded from binary- 
encoded hbraries (a single ECGC apparatus can decode 50 structures per day) Thus 
bmary-encoded libraries can be used for the rapid analysis of structure-activit^ 
relationships and optimization of both potency and selectivity of an active series The 
synthesis and screening of large unbiased binary encoded HDx inhibitor compound 
hbranes for lead identification, followed by preparation and analysis of smaller focused 
hbranes for lead optimization, offers a particularly powerful approach to discovery of 
HDx inhibitor compounds, 

HDx inhibitor compounds can be synthesized on solid support by appropriate 
functionalization for attachment to a solid matrix, or alternatively, by solution-phase 
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synthesis followed by immobilization through an appropriate functional group. Thus, in 
an illustrative embodiment, an HDx inhibitor compound, which is analogous ' to 
trichostatin, can be synthesized on a solid support by attachment through an amino group 
of the specificity element A, as shown in Figure 7. The solid support is preferably 
capable of withstanding synthetic conditions required to synthesize the requisite 
compounds. The compound can preferably be released from the solid support, e.g., by 
selective cleavage of an amide bond. 

The synthetic steps employed to synthesize compounds on solid support are 
preferably selected to allow a wide variety of residues (e.g., building blocks) to be 
coupled to the immobilized moieties, preferably under mild conditions. Suitable reaction 
chemistries include well-known carbon-carbon bond forming reactions such as the Stille 
and Suzuki couplings, as well as Homer-Emmons reactions, Ni/Cr mediated couplings, 
and the like. Particularly preferred coupling reactions can be performed in the presence 
of water and do not require harsh conditions or expensive reagents. 

Thus, in an exemplary synthesis shovra in Figure 7, substituted N-methyi-4- 
(tributyltin)anilines (in which Ri represents one or more substitutions, e.g., hydrogen, 
halogen, alkyl, alkoxy, and the like) are coupled in a plurality of reaction vessels to beads 
of a solid support (e.g., Affigel). The beads are further divided into a plurality of reaction 
vessels, and suspended in a solvent such as DMF, and one acid chloride building block 
(corresponding to linking element B) is introduced into each vessel (R2 and R3 represent, 
e.g., hydrogen, halogen, alkyl. and the like; and the broken line represents an optional 
double bond). The reactions are stirred under an inert gas (e.g. nitrogen) and a palladium 
catalyst (e.g.. Pd(PPh3)4) is added (0.1-1.0 mol%). The reaction is stirred for 1-24 
hours. Upon completion of the reaction, the beads are washed, and placed in a plurality 
25 of vessels. The aldehyde moiety is deprotected by mild acid treatment (e.g., PPTS in 
MeOH), and the beads are again washed and placed in a plurality of reaction vessels, and 
the beads are suspended in dry acetonitrile. One building block (corresponding to the 
reactive element C) is then added to each reaction vessel. As illustratively shown in 
Figure 7, a plurality of phosphonates can be employed CR4 represents, e.g., alkyl, alkenyl, 
30 alkynyl, alkoxy, and the like). A Homer-Emmons reaction is perfonned by addition of 
LiCI (1.1 equiv.) and diisopropylethylamine (DIPEA) or DBU (1.2 equiv). Upon 
completion of the reaction, the beads are washed with water and acetonitrile, and then 
dried to yield a library of candidate HDx inhibitor compounds on solid support. The 
compounds can then be released from the solid support into solution; or the compounds 
35 can be screened while attached to the solid support. 
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The above combinatorial synthesis can be performed in an encoded mode, e.g 
the binao^ tagging method described supra, by addition of the appropriate tag for each 
monomer. In this mode, after each reaction has been performed and the corresponding 
tag attached, the beads from all reactions can be recombined and then divided into 
aliquots for further derivatization. This method provides the advantage of ease of 
handling when large libraries are to be synthesized. Regardless of the method of 
synthesis, the combinatorial library can be screened for activity according to known 
methods (see, e.g., Gordon et al., supra). 
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10 In another aspect, the present invention provides pharmaceutically acceptable 

compositions which comprise a therapeutically-effective amount of one or more of the 
compounds described above, formulated together with one or more pharmaceutically 
acceptable carriers (additives) and/or diluents. As described in detail below, the 
pharmaceutical compositions of the present invention may be specially formulated for 
administration in solid or liquid form, including those adapted for the following: (1) oral 
administration, for example, drenches (aqueous or non-aqueous solutions or 
suspensions), tablets, boluses, powders, granules, pastes for application to the tongue; (2) 
paremeral administration, for example, by subcutaneous, intramuscular or intravenous 
injection as, for example, a sterile solution or suspension; (3) topical application, for 
example, as a cream, ointment or spray applied to the skin; or (4) intravaginally or 
intrarectally, for example, as a pessary, cream or foam. 

The phrase "therapeutically-effective amount" as used herein means that amount 
of a compound, material, or composition comprising a deacetylase inhibitor of the present 
invention which is effective for producing some desired therapeutic effect by inhibiting 
histone deacetylation in at least a sub-population of cells in an animal and thereby 
blocking the biological consequences of that event in the treated cells, at a reasonable 
benefit/risk ratio applicable to any medical treatment. 

The phrase "pharmaceutically acceptable" is employed herein to refer to those 
compounds, materials, compositions, and/or dosage forms which are, within the scope of 
sound medical judgment, suitable for use in contact with the tissues of human beings and 
animals without excessive toxicity, irritation, allergic response, or other problem or 
complication, commensurate with a reasonable benefit/risk ratio. 

The phrase "pharmaceutically-acceptable carrier" as used herein means a 
pharmaceutically-acceptable material, composition or vehicle, such as a liquid or soUd 
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filler, diluent, excipient, solvent or encapsulating material, involved in carrying or 
transporting the subject deacetylase inhibitor agent from one organ, or portion of the 
body, to another organ, or portion of the body. Each carrier must be "acceptable" in the 
sense of being compatible with the other ingredients of the formulation and not injurious 
to the patient. Some examples of materials which can serve as pharmaceutically- 
acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, 
such as com starch and potato starch; (3) cellulose, and its derivatives, such as sodium 
carboxymethyl cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; 
(5) malt; (6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes,' 
(9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, com oil and 
soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, 
sorbitol, mannitol and polyethylene glycol; (12) esters, such as ethyl oleate and ethyl 
laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum 
hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's 
solution; (19) ethyl alcohol; (20) phosphate buffer solutions; and (21) other non-toxic 
compatible substances employed in phamiaceutical formulations. 

As set out above, certain embodiments of the present deacetylase inhibitors may 
contain a basic functional group, such as amino or alkylamino, and are, thus, capable of 
forming phamiaceutically-acceptable salts with pharmaceutically-acceptable acids. The 
term "pharmaceutically-acceptable salts" in this respect, refers to the relatively non-toxic, 
inorganic and organic acid addition salts of compounds of the present invention. These 
salts can be prepared in situ during the final isolation and purification of the compounds 
of the invention, or by separately reacting a purified compound of the invention in its free 
base form with a suitable organic or inorganic acid, and isolating the salt thus formed. 
Representative salts include the hydrobromide, hydrochloride, sulfate, bisulfate. 
phosphate, nitrate, acetate, valerate, oleate, palmitate, stearate, laurate. benzoate, lactate,' 
phosphate, tosylate, citrate, maleate, fumarate, succinate, tartrate, napthylate. mesylate,' 
glucoheptonate, lactobionate, and lauiylsulphonate salts and the like. (See, for example,' 
Berge et al. (1977) "Pharmaceutical Salts", J. Pharm. Sci. 66: 1-19) 

In other cases, the deacetylase inhibitory compounds of the present invention may 
contain one or more acidic functional groups and, thus, are capable of forming 
pharmaceutically-acceptable salts with pharmaceutically-acceptable bases. The term 
"pharmaceutically-acceptable salts" in these instances refers to the relatively non-toxic, 
inorganic and organic base addition salts of compounds of the present invention. These 
salts can likewise be prepared in situ during the final isolation and purification of the 
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compounds, or by separately reacting the purified compound in its free acid form with a 
suitable base, such as the hydroxide, carbonate or bicarbonate of a pharmaceutically- 
acceptable metal cation, with ammonia, or with a pharmaceutically-acceptable organic 
primary, secondary or tertiary amine. Representative alkali or alkaline earth salts include 
the lithium, sodium, potassium, calcium, magnesium, and aluminum salts and the like 
Representative organic amines useful for the formation of base addition salts include 
ethylamine, diethylamine, ethylenediamine, ethanolamine. diethanolamine, piperazine and 
the like. (See, for example, Berge et al., supra) 

Wetting agents, emulsifiers and lubricants, such as sodium lauryl sulfate and 
magnesium stearate, as well as coloring agents, release agents, coating agents, 
sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be 
present in the compositions. 

Examples of pharmaceutically-acceptable antioxidants include: (1) water soluble 
antioxidants, such as ascorbic acid, cysteine hydrochloride, sodium bisulfate, sodium 
metabisulfite, sodium sulfite and the like; (2) oil-soluble antioxidants, such as ascorbyl 
palmitate, butylated hydroxyanisole (BHA), butylated hydroxytoluene (BHT), lecithin, 
propyl gallate, alpha-tocopherol, and the like; and (3) metal chelating agents', such as 
citric acid, ethylenediamine tetraacetic acid (EDTA), sorbitol, tartaric acid, phosphoric 
acid, and the like. 

Formulations of the present invention include those suitable for oral, nasal, topical 
(including buccal and sublingual), rectal, vaginal and/or parenteral administration. The 
fonnulations may conveniently be presented in unit dosage fonn and may be prepared by 
any methods well known in the art of phannacy. The amount of active ingredient which 
can be combined with a carrier material to produce a single dosage form will vary 
depending upon the host being treated, the particular mode of administration. The 
amount of active ingredient which can be combined with a carrier material to produce a 
single dosage fonn will generally be that amount of the deacetylase inhibitor which 
produces a therapeutic effect. Generally, out of one hundred per cent, this amount will 
range from about 1 per cent to about ninety-nine percent of active ingredient, preferably 
from about 5 per cent to about 70 per cent, most preferably from about 10 per cent to 
about 30 per cent. 

Methods of preparing these fonnulations or compositions include the step of 
bringing into association a compound of the present invention with the canier and, 
optionally, one or more accessory ingredients. In general, the fonnulations are prepared 
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by uniformly and intimately bringing into association a deacetylase inhibitor of the present 
mvention with liquid carriers, or finely divided solid carriers, or both, and then, if 
necessary, shaping the product. 

Formulations of the invention suitable for oral administration may be in the form 
of capsules, cachets, pills, tablets, lozenges (using a flavored basis, usually sucrose and 
acacia or tragacanth), powders, granules, or as a solution or a suspension in an aqueous 
or non-aqueous liquid, or as an oil-in-water or water-in-oil liquid emulsion, or as an elixir 
or syrup, or as pastilles (using an inert base, such as gelatin and glycerin, or sucrose and 
acaca) and/or as mouth washes and the like, each containing a predetermined amount of 
a compound of the present invention as an active ingredient. A deacetylase inhibitor of 
the present invention may also be administered as a bolus, electuary or paste. 

In solid dosage forms of the invention for oral administration (capsules, tablets 
pills, dragees, powders, granules and the like), the active ingredient is mixed with one or 
more pharmaceutically-acceptable carriers, such as sodium citrate or dicalcium 
phosphate, and/or any of the following: (1) fillers or extenders, such as starches, lactose, 
sucrose, glucose, mannitol, and/or silicic acid; (2) binders, such as, for example,' 
carboxymethylcellulose, alginates, gelatin, polyvinyl pyrrolidone, sucrose and/or acacia,' 
(3) humectants, such as glycerol; (4) disintegrating agents, such as agar-agar, calcium, 
carbonate, potato or tapioca starch, alginic acid, certain silicates, and sodium carbonate; 
(5) solution retarding agems, such as paraffin; (6) absorption accelerators, such as 
quaternary ammonium compounds; (7) wetting agents, such as, for example, cetyl alcohol 
and glycerol monostearate; (8) absorbents, such as kaolin and bentonite clay; (9) 
lubricants, such a talc, calcium stearate, magnesium stearate, solid polyethylene glycols, 
sodium lauiyl sulfate, and mixtures thereof; and (10) coloring agents. In the case of 
capsules, tablets and pills, the pharmaceutical compositions may also comprise buffering 
agents. Solid compositions of a similar type may also be employed as fillers in soft and 
hard-filled gelatin capsules using such excipients as lactose or milk sugars, as well as high 
molecular weight polyethylene glycols and the like. 

A tablet may be made by compression or molding, optionally with one or more 
accessory ingredients. Compressed tablets may be prepared using binder (for example, 
gelatin or hydroxypropylmethyl cellulose), lubricant, inert diluent, preservative,' 
disintegrant (for example, sodium starch glycolate or cross-linked sodium carboxymethyl 
cellulose), surface-active or dispersing agent. Molded tablets may be made by molding in 
a suitable machine a mixture of the powdered deacetylase inhibitor moistened with an 
35 inert liquid diluent. 
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The tablets, and other solid dosage forms of the pharmaceutical compositions of 
the present invention, such as dragees, capsules, pills and granules, may optionally be 
scored or prepared with coatings and shells, such as enteric coatings and other coatings 
well Icnown m the pharmaceutical-formulating art. They may also be fonnulated so as to 
5 provide slow or controlled release of the active ingredient therein using, for example 
hydroxypropylmethyl cellulose in vaxying proportions to provide the desired release 
profit., other polymer matrices, liposomes and/or microspheres. They may be sterilized 
by, for example, filtration through a bacteria-retaining filter, or by incorporating 
stenhzmg agents in the fonn of sterile solid compositions which can be dissolved in sterile 
water, or some other sterile injectable medium immediately before use These 
compositions may also optionally contain opacifying agents and may be of a composition 
that they release the active ingredient(s) only, or preferentially, in a certain portion of the 
gastrointestinal tract, optionally, in a delayed mamier. Examples of embedding 
compositions which can be used include polymeric substances and waxes. The active 
15 ingredient can also be in micro-encapsulated form, if appropriate, with one or more of the 
above-descnbed excipients. 

Liquid dosage forms for oral administration of the deacetylase inhibitors of the 
invention include pharmaceutically acceptable emulsions, microemulsions. solutions 
suspensions, syrups and elixirs. In addition to the active ingredient, the liquid dosage 
fornis may contain inert diluents commonly used in the art, such as, for example, water or 
other solvents, solubilizing agents and emulsifiers, such as ethyl alcohol, isopropyl 
alcohol, ethyl carbonate, ethyl acetate, benzyl alcohol, benzyl benzoate, propylene glycol 
1,3-butylene glycol, oils (in particular, cottonseed, groundnut, com, gemi, olive, castor 
and sesame oils), glycerol, tetrahydrofiiryl alcohol, polyethylene glycols and fatty acid 
^5 esters of sorbitan, and mixtures thereof 

Besides inert diluents, the oral compositions can also include adjuvants such as 
wetting agents, emulsifying and suspending agents, sweetening, flavoring, coloring 
perfiiming and preservative agents. 

Suspensions, in addition to the active deacetylase inhibitor, may contain 
suspending agents as. for example, ethoxylated isosteaiyl alcohols, polyoxyethylene 
sorbitol and sorbitan esters, microciystalline cellulose, aluminum metahydroxide 
bentonite, agar-agar and tragacanth, and mixtures thereof 

Formulations of the pharmaceutical compositions of the invention for rectal or 
vaginal administration may be presented as a suppositoiy, which may be prepared by 
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mixing one or more compounds of the invemion with one or more suitable nonirritating 
excpients or earners comprising, for example, cocoa butter, polyethylene glycol a 
suppository wax or a salicylate, and which is solid at room temperature, but liquid at 
body temperature and. therefore, will melt in the rectum or vaginal cavity and release the 
active deacetylase inhibitor. 

Formulations of the present invention which are suitable for vaginal administration 
also include pessaries, tampons, creams, gels, pastes, foams or spray formulations 
containing such carriers as are known in the art to be appropriate. 

Dosage forms for the topical or transdermal administration of a deacetylase 
inhibitor of this invention include powders, sprays, ointments, pastes, creams, lotions 
gels, solutions, patches and inhalants. The active compound may be mixed under sterile 
conditions with a pharmaceutically-acceptable carrier, and with any preservatives, buffers, 
or propellants which may be required. 

The oimments. pastes, creams and gels may contain, in addition to an active 
deacetylase inhibitor of this invention, excipients. such as animal and vegetable fats oils 
waxes, paraffins, starch, tragacanth, cellulose derivatives, polyethylene glycols, silicones' 
bentomtes, silicic acid, talc and zinc oxide, or mixtures thereof 

Powders and sprays can comain, in addition to a compound of this invention 
excipients such as lactose, talc, silicic acid, aluminum hydroxide, calcium silicates and 
poiyamide powder, or mixtures of these substances. Sprays can additionally contain 
customaiy propellants, such as chlorofluorohydrocarbons and volatile unsubstituted 
hydrocarbons, such as butane and propane. 

Transdermal patches have the added advamage of providing controlled delivery of 
a compound of the present invention to the body. Such dosage forms can be made by 
dissolving or dispersing the deacetylase inhibitor in the proper medium. Absorption 
enhancers can also be used to increase the flux of the deacetylase inhibitor across the 
skm. The rate of such flux can be controlled by either providing a rate controlling 
membrane or dispersing the deacetylase inhibitor in a polymer matrix or gel. 

Ophthalmic formulations, eye ointments, powders, solutions and the like, are also 
30 contemplated as being within the scope of this invention. 

Pharmaceutical compositions of this invention, suitable for parenteral 
admimstration comprise one or more deacetylase inhibitors of the invention in 
combination with one or more pharmaceutically-acceptable sterile isotonic aqueous or 
nonaqueous solutions, dispersions, suspensions or emulsions, or sterile powders which 
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may be reconstituted into sterile injectable solutions or dispersions just prior to use, 
which may contain antioxidants, buffers, bacteriostats, solutes which render the 
formulation isotonic with the blood of the intended recipient or suspending or thickening 
agents. 

5 Examples of suitable aqueous and nonaqueous carriers which may be employed in 

the pharmaceutical compositions of the invention include water, ethanol, polyols (such as 
glycerol, propylene glycol, polyethylene glycol, and the like), and suitable mixtures 
thereof, vegetable oils, such as olive oil, and injectable organic esters, such as ethyl 
oleate. Proper fluidity can be maintained, for example, by the use of coating materials, 
10 such as lecithin, by the maintenance of the required particle size in the case of dispersions,' 
and by the use of surfactants. 

These compositions may also contain adjuvants such as preservatives, wetting 
agents, emulsifying agents and dispersing agents. Prevention of the action of 
microorganisms may be ensured by the inclusion of various antibacterial and antifungal 
agents, for example, paraben, chlorobutanol, phenol sorbic acid, and the like. It may also 
be desirable to include isotonic agents, such as sugars, sodium chloride, and the like into 
the compositions. In addition, prolonged absorption of the injectable pharmaceutical form 
may be brought about by the inclusion of agents which delay absorption such as 
aluminum monostearate and gelatin. 

In some cases, in order to prolong the effect of a drug, it is desirable to slow the 
absorption of the drug from subcutaneous or intramuscular injection. This may be 
accomplished by the use of a liquid suspension of crystalline or amorphous material 
having poor water solubility. The rate of absorption of the drug then depends upon its 
rate of dissolution which, in turn, may depend upon crystal size and crystalline form. 
Alternatively, delayed absorption of a parenterally-administered drug form is 
accomplished by dissolving or suspending the drug in an oil vehicle. 

Injectable depot forms are made by forming microencapsule matrices of the 
subject deacetylase inhibitors in biodegradable polymers such as polylactide- 
polyglycolide. Depending on the ratio of drug to polymer, and the nature of the particular 
polymer employed, the rate of drug release can be controlled. Examples of other 
biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot injectable 
fonnulations are also prepared by entrapping the drug in liposomes or microemulsions 
which are compatible with body tissue. 
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When the compounds of the present invention are administered as 
pharmaceuticals, to humans and animals, they can be given per se or as a pharmaceutical 
composition containing, for example. 0.1 to 99.5% (more preferably. 0.5 to 90%) of 
active ingredient in combination with a pharmaceutically acceptable carrier. 
5 The preparations of the present invention may be given orally, parenterally, 

topically, or rectally. They are of course given by forms suitable for each administration 
route. For example, they are administered in tablets or capsule form, by injection, 
inhalation, eye lotion, ointment, suppository, etc. administration by injection, infusion or 
inhalation; topical by lotion or ointment; and rectal by suppositories. Oral administration 
10 is preferred. 

These deacetylase inhibitor may be administered to humans and other animals for 
therapy by any suitable route of administration, including orally, nasally, as by, for 
example, a spray, rectally. intravaginally, parenterally, intracistemally and topically, as by 
powders, ointments or drops, including buccally and sublingually. 

1 5 Regardless of the route of administration selected, the compounds of the present 

invention, which may be used in a suitable hydrated form, and/or the pharmaceutical 
compositions of the present invention, are formulated into pharmaceutically-acceptable 
dosage forms by conventional methods known to those of skill in the art. 

Actual dosage levels of the active ingredients in the pharmaceutical compositions 
20 of this invention may be varied so as to obtain an amount of the active ingredient which is 
effective to achieve the desired therapeutic response for a particular patient, composition, 
and mode of administration, without being toxic to the patient. 

The selected dosage level will depend upon a variety of factors including the 
activity of the particular deacetylase inhibitor employed, or the ester, salt or amide 
25 thereof, the route of administration, the time of administration, the rate of excretion of 
the particular compound being employed, the duration of the treatment, other drugs, 
compounds and/or materials used in combination with the particular deacetylase inhibitor 
employed, the age, sex, weight, condition, general health and prior medical history of the 
patient being treated, and like factors well known in the medical arts. 

A physician or veterinarian having ordinary skill in the art can readily determine 
and prescribe the effective amount of the pharmaceutical composition required. For 
example, the physician or veterinarian could start doses of the compounds of the 
invention employed in the pharmaceutical composition at levels lower than that required 
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in order to achieve the desired therapeutic effect and gradually increase the dosage until 
the desired effect is achieved. 

Another aspect of the present invention relates to a method of inducing and/or 
mamta^ning a differentiated state, enhancing survival, and/or inhibiting (or alternatively 
5 ~t.ng) proliferation of a cell, by contacting the cells with an agent which modulates 
U^. Z transcription. For instance, it is contemplated by the invention that, in 
hght of the present finding of an apparently broad involvement of HDx proteins in the 
control of chromatin structure and, thus, transcription and replication, the subject method 
could be used to generate and/or maintain an array of different tissue both /« v.>c and 
10 An "HD. therapeutic." whether inhibitor or potentiating with respect to 

Tt7. ^--^y'-^'-' - be. as appropriate, any of the preparations 

descnbed above, mcluding isolated polypeptides, gene therapy constructs, antisense 
molecules, peptidomimetics or agents identified in the drug assays provided herein. 
• f^" compounds of the present invention are likely to play an important role 

m the modulation of cellular proliferation! There are a wide variety of pathological cell 
proliferative conditions for which HDx therapeutics of the present invention may be used 
m treatment. For instance, such agents can provide therapeutic benefits where the 
generaJ strategy being the inhibition of an anomalous cell proliferation. Diseases that 
might benefit from this methodology include, but are not limited to various cancers and 
Jeukemias, psoriasis, bone diseases, fibroproliferative disorders such as involving 
connective tissues, atherosclerosis and other smooth muscle proliferative disorders as 
well as chronic inflammation. 
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In addition to proliferative disorders, the present invention contemplates the use 
of HDx therapeutics for the treatment of differentiative disorders which result from, for 
example, de-differentiation of tissue which may (optionally) be accompanied by abortive 
reentiy into mitosis, e.g. apoptosis. Such degenerative disorders include chronic 
neurodegenerative diseases of the nervous system, including Alzheimer's disease 
Parkinsons disease, Huntington's chorea, amyotrophic lateral sclerosis and the like as 
well as spinocerebellar degenerations. Other differentiative disorders include 'for 
example, disorders associated with connective tissue, such as may occur due to de- 
differentiation of chondrocytes or osteocytes, as well as vascular disorders which involve 
de-differenuatu>n of endothelial tissue and smooth muscle cells, gastric ulcers 
characterised by degenerative changes in glandular cells, and renal conditions marked by 
taUure to differentiate, e.g. Wilm's tumors. 
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It will also be apparent that, by transient use of modulators oiHDx activities, in 
vivo reformation of tissue can be accomplished, e.g. in the development and maintenance 
of organs. By controlling the proliferative and differentiative potential for different cells, 
the subject HDx therapeutics can be used to reform injured tissue, or to improve grafting 
5 and morphology of transplanted tissue. For instance, HDx antagonists and agonists can 
be employed in a differential manner to regulate different stages of organ repair after 
physical, chemical or pathological insult. For example, such regimens can be utilized in 
repair of cartilage, increasing bone density, liver repair subsequent to a partial 
hepatectomy, or to promote regeneration of lung tissue in the treatment of emphysema. 
10 The present method is also applicable to cell culture techniques. 

In one embodiment, the HDx therapeutic of the present invention can be used to 
induce differentiation of uncommitted progenitor cells and thereby give rise to a 
committed progenitor cell, or to cause further restriction of the developmental fate of a 
committed progenitor cell towards becoming a terminally-differentiated cell. For 
1 5 example, the present method can be used in vitro or in vivo to induce and/or maintain the 
differentiation of hematopoietic cells into erythrocytes and other cells of the 
hematopoietic system. In an illustrative embodiment, the effect of erythropioetin (EPO) 
on the growth of EPO-responsive erythroid precursor cells is increased to influence their 
differentiation into red blood cells. For example, as a result of administering an inhibitor 
of histone deacetylation, the amount of EPO, or other diferentiating agent, required for 
growth and/or differentiation is reduced (PCT/US92/07737). Accordingly, the HDx 
therapeutics of the present invention, particularly those which antagonize HDx 
deacetylase activity, can be administered alone or in conjunction with EPO and in a 
suitable carrier to vertebrates to promote erythropoiesis. Alternatively, cells could be 
treated ex vivo. Such treatment is contemplated in the treatment of a variety of disease 
states, including in individuals who require bone marrow transplants (e.g. patients with 
aplastic anemia, acute leukemias, recurrent lymphomas, or solid tumors). 

To illustrate, prior to receiving a bone marrow transplant, a recipient is prepared 
by ablating or removing endogenous hematopoietic stem cells. Such treatment is usually 
carried out by total body irradiation or delivery of a high dose of an alkylating agent or 
other chemotherapeutic, cytotoxic agent, Anklesaria, et al. (1987) PNAS 84:7681-7685). 
Following preparation of the recipient, donor bone marrow cells are injected 
intravenously. Optionally, the HDx therapeutics of the present invention could be 
contacted with the cells ex vivo or administered to the subject with the reimplanted cells. 
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It is also contemplated that there may be cell-type specific HDx proteins, and/or 
that some cell types may be more sensitive to modulation of HDx deacetylase activities. 
Even within a cell type, the stage of differentiation or position in the cell cycle could 
influence their response to an HDx therapeutic. Accordingly, the present invention 
5 contemplates the use of agents which modulate histone deacetylase activity to specifically 
inhibit or activate certain cell types. In an illustrative example, T cell proliferation could 
be preferentially inhibited in order to induce tolerance by using a procedure similar to that 
for inducing tolerance using sodium butyrate (see, for example, PCT/US93/03045). To 
illustrate, the HDx therapeutics of the present invention may be used to induce antigen- 
ic specific tolerance in any situation in which it is desirable to induce tolerance, such as 
autoimmune diseases, in allogeneic or xenogeneic transplant recipients, or in graft versus 
host (GVH) reactions. According to the invention, tolerance will typically be induced by 
presenting the tolerizing compound (e.g., an HDx inhibitor) substantially 
contemporaneously with the antigen, i.e. reasonably close together in time with the 
15 antigen. In preferred embodiments the HDx therapeutic will be administered after 
presentation of the antigen, so that they will have their effect after the particular 
repertoire of Th cells begins to undergo clonal expansion. 

Yet another aspect of the present invention concerns the application oi HDx 
therapeutics to modulating morphogenic signals involved in organogenic pathways. 
Thus, it is contemplated by the invention that compositions comprising HDx therapeutics 
can also be utilized for both cell culture and therapeutic methods involving generation 
and maintenance of tissue. 

In a fiirther embodiment of the invention, the subject HDx therapeutics will be 
useful in increasing the amount of protein produced by a cell or recombinant cell The 

25 cell may include any primary cell isolated fi-om any animal, cultured cells, immortalized 
cells, and established cell lines. The animal cells used in the present invention include 
cells which intrinsically have an ability to produce a desired protein; cells which are 
induced to have an ability to produce a desired protein, for example, by stimulation with 
a cytokine such as an interferon, an interieukin; genetically engineered cells into which a 

30 gene for a desired protein is introduced. The protein produced by the process could 
mclude any peptides or proteins, including peptide hormone or proteinaceous hormones 
such as any usefiil hormone, cytokine, interieukin, or protein which it may be desirable to 
have in purified form and/or in large quantity. 

Another aspect of the invention features transgenic non-human animals which 
35 express a heterologous HDx gene of the present invention, or which have had one or 
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more genomic HDx genes disrupted in at least one of the tissue or cell-types of the 
animal. Accordingly, the invention features an animal model for developmental diseases, 
which animal has one or more HDx allele which is mis-expressed. For example, a mouse 
can be bred which has one or more HDx alleles deleted or othenvise rendered inactive. 
Such a mouse model can then be used to study disorders arising from mis-expressed HDx 
genes, as well as for evaluating potential therapies for similar disorders. 

Another aspect of the present invention concerns transgenic animals which are 
comprised of cells (of that animal) which contain a transgene of the present invention and 
which preferably (though optionally) express an exogenous HDx protein in one or more 
cells in the animal. An HDx transgene can encode the wild-type form of the protein, or 
can encode homologs thereof, including both agonists and antagonists, as well as 
antisense constructs. In preferred embodiments, the expression of the transgene is 
restricted to specific subsets of cells, tissues or developmental stages utilizing, for 
example, cis-acting sequences that control expression in the desired pattern. In the 
present invention, such mosaic expression of an HDx protein can be essential for many 
fonns of lineage analysis and can additionally provide a means to assess the effects of. for 
example, lack of HDx expression which might grossly alter development in small patches 
of tissue within an otherwise normal embryo. Toward this and, tissue-specific regulatory 
sequences and conditional regulatory sequences can be used to control expression of the 
transgene in certain spatial patterns. Moreover, temporal patterns of expression can be 
provided by, for example, conditional recombination systems or prokaryotic 
transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated 
via site-specific genetic manipulation in vivo are known to those skilled in the art. For 
instance, genetic systems are available which allow for the regulated expression of a 
recombinase that catalyzes the genetic recombination a target sequence. As used herein, 
the phrase "target sequence" refers to a nucleotide sequence that is genetically 
recombined by a recombinase. The target sequence is flanked by recombinase 
recognition sequences and is generally either excised or inverted in cells expressing 
recombinase activity. Recombinase catalyzed recombination events can be designed such 
that recombination of the target sequence results in either the activation or repression of 
expression of one of the subject HDx proteins. For example, excision of a target 
sequence which interferes with the expression of a recombinant HDx gene, such as one 
which encodes an antagonistic homolog or an antisense transcript, can be designed to 
activate expression of that gene. This interference with expression of the protein can 
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result from a variety of mechanisms, such as spatial separation of the HDx gene from the 
promoter element or an internal stop codon. Moreover, the transgene can be made 
wherein the coding sequence of the gene is flanked by recombinase recognition sequences 
and is initially transfected into cells in a 3' to 5' orientation with respect to the promoter 
5 element. In such an instance, inversion of the target sequence will reorient the subject 
gene by placing the 5' end of the coding sequence in an orientation with respect to the 
promoter element which allow for promoter driven transcriptional activation. 

In an illustrative embodiment, either the cre/loxP recombinase system of 
bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 

10 89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman 
et al. (1991) Science 251:1351-1355; PCX publication WO 92/15694) can be used to 
generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes 
the site-specific recombination of an intervening target sequence located between loxP 
sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre 

15 recombinase binds and are required for Cre recombinase mediated genetic recombination. 
The orientation of loxP sequences determines whether the intervening target sequence is 
excised or inverted when Cre recombinase is present (Abremski et al. (1984) J. Biol. 
Chem. 259:1509-1514); catalyzing the excision of the target sequence when the loxP 
sequences are oriented as direct repeats and catalyzes inversion of the target sequence 

20 when loxP sequences are oriented as inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on 
expression of the Cre recombinase. Expression of the recombinase can be regulated by 
promoter elements which are subject to regulatory control, e.g., tissue-specific, 
developmental stage-specific, inducible or repressible by externally added agents. This 
regulated control will result in genetic recombination of the target sequence only in cells 
where recombinase expression is mediated by the promoter element. Thus, the activation 
expression of a recombinant HDx protein can be regulated via control of recombinase 
expression. 

Use of the cre/loxP recombinase system to regulate expression of a recombinant 
HDx protein requires the construction of a transgenic animal containing transgenes 
encoding both the Cre recombinase and the subject protein. Animals containing both the 
Cre recombinase and a recombinant HDx gene can be provided through the construction 
of "double" transgenic animals. A convenient method for providing such animals is to 
mate two transgenic animals each containing a transgene, e.g., an HDx gene and 
35 recombinase gene. 
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One advantage derived from initially constructing transgenic animals containing 
an HDx transgene in a recombinase-mediated expressible format derives from the 
likelihood that the subject protein, whether agonistic or antagonistic, can be deleterious 
upon expression in the transgenic animal. In such an instance, a founder population, in 
5 which the subject transgene is silent in all tissues, can be propagated and maintained. 
Individuals of this founder population can be crossed with animals expressing the 
recombinase in, for example, one or more tissues and/or a desired temporal pattern. 
Thus, the creation of a founder population in which, for example, an antagonistic HDx 
transgene is silent will allow the study of progeny from that founder in which disruption 
10 of HDx mediated induction in a particular tissue or at certain developmental stages would 
result in, for example, a lethal phenotype. 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the HDx transgene. Exemplary promoters and the corresponding 
1 5 trans-activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. 

Moreover, expression of the conditional transgenes can be induced by gene 
therapy-like methods wherein a gene encoding the trans-activating protein, e.g. a 
recombinase or a prokaryotic protein, is delivered to the tissue and caused to be 
expressed, such as in a cell-type specific manner. By this method, an HDx transgene 
20 could remain silent into adulthood until "turned on" by the introduction of the trans- 
activator. 

In an exemplary embodiment, the "transgenic non-human animals" of the 
invention are produced by introducing transgenes into the germline of the non-human 
animal. Embryonic target cells at various developmental stages can be used to introduce 

25 transgenes. Different methods are used depending on the stage of development of the 
embryonic target cell. The zygote is the best target for micro-injection. In the mouse, the 
male pronucleus reaches the size of approximately 20 micrometers in diameter which 
allows reproducible injection of l-2pl of DNA solution. The use of zygotes as a target for 
gene transfer has a major advantage in that in most cases the injected DNA will be 

30 incorporated into the host gene before the first cleavage (Brinster et al. (1985) PNAS 
82:4438-4442). As a consequence, all cells of the transgenic non-human animal will carry 
the incorporated transgene. This will in general also be reflected in the eflScient 
transmission of the transgene to offspring of the founder since 50% of the germ cells will 
harbor the transgene. Microinjection of zygotes is the preferred method for incorporating 

35 transgenes in practicing the invention. 
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Retroviral infection can also be used to introduce HDx transgenes into a non- 
human animal. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retroviral infection 
(Jaenich, R. (1976) PNAS 73:1260-1264). EfiBcient infection of the blastomeres is 
obtained by enzymatic treatment to remove the zona pellucida (Manipulating the Mouse 
Embryo, Hogan eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). 
The viral vector system used to introduce the transgene is typically a repUcation-defective 
retrovirus carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der 
Putten et al. (1985) PNAS 82:6148-6152). Transfection is easily and efiSciently obtained 
by culturing the blastomeres on a monolayer of virus-producing cells (Van der Putten, 
supra; Stewart et al. (1987) EMBO J. 6:383-388). Alternatively, infection can be 
performed at a later stage. Virus or virus-producing cells can be injected into the 
blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the founders will be 
mosaic for the transgene since incorporation occurs only in a subset of the cells which 
formed the transgenic non-human animal. Further, the founder may contain various 
retroviral insertions of the transgene at different positions in the genome which generally 
will segregate in the offspring. In addition, it is also possible to introduce transgenes into 
the germ line by intrauterine retroviral infection of the midgestation embryo (Jahner et al. 
(1982) supra). 



A third type of target cell for transgene introduction is the embryonic stem cell 
(ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused 
wnth embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 
309:255-258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) 
Nature 322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA 
25 transfection or by retrovirus-mediated transduction. Such transformed ES cells can 
thereafter be combined with blastocysts from a non-human animal. The ES cells 
thereafter colonize the embryo and contribute to the germ line of the resulting chimeric 
animal. For review see Jaenisch, R. (1988) Science 240: 1468-1474. 

Methods of making HDx knock-out or disruption transgenic animals are also 
generally known. See, for example. Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Recombinase dependent 
knockouts can also be generated, e.g. by homologous recombination to insert 
recombinase target sequences flanking portions of an endogenous HDx gene, such that 
tissue specific and/or temporal control of inactivation of an iZCbc allele can be controlled 
35 as above. 
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Exemplification 

The invention, now being generally described, will be more readily understood by 
reference to the following examples, which are included merely for purposes of 
illustration of certain aspects and embodiments of the present invention and are not 
intended to limit the invention. 



Example 1 

Trapoxin is a microbially derived cyclotetrapeplide that inhibits histone 
deacetylation in vivo and causes mammalian cells to arrest in the cell cycle. A trapoxin 
affinity matrix was used to isolate two nuclear proteins that copurified with histone 
deacetylase activity. Both proteins were identified by peptide microsequencing, and a 
cDNA encoding the histone deacetylase catalytic subunit (HDI) was cloned from a 
Jurkat T cell library. As the predicted protein is highly similar to the yeast transcriptional 
regulator RPD3, this study supports a role for histone deacetylase as a key regulator of 
eukaryotic transcription. 

A requirement for a functional histone deacetylase in cell cycle progression has 
been implicated by the discovery that two cytostatic agents, trapoxin and trichostatin 
(Figure 1 A), inhibit histone deacetylation in cultured mammalian cells and in fractioned 
cell extracts (4). In addition to causing Gj and phase cell cycle arrest, these natural 
products alter gene expression and induce certain mammalian cell lines to differentiate. 
Whereas sodium butyrate also has these properties, both trapoxin and trichostatin are five 
orders of magnitude more potent. 

Trapoxin is an "irreversible" inhibitor of histone deacetylase activity and its 
25 molecular structure offers clues as to how it could form a covalent bond with a 
nucleophilic active site residue. First, trapoxin contains an electrophilic epoxyketone that 
is essential for biological activity (5). Second, the aliphatic epoxyketone side chain is 
approximately isosteric with N-acetyl lysine (Figure lA). Trapoxin likely acts as a 
substrate mimic, with epoxyketone poised to alkylate an active site nucleophile. We 
therefore regarded trapoxin as a tool that could reveal the molecular identity of histone 
deacetylase, so that its role in transcriptional regulation and cell cycle progression could 
be elucidated. 
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Tntium-labeled trapoxin was prepared by total synthesis and used to identify 
trapoxin bindmg protein in crude extracts from bovine thymus. We used a charcoal 
precipitation assay to detect a specific trapoxin binding activity primarily in the nuclear 
fraction of the extracts (6). The binding activity was saturable with nanomolar 
concentrations of [3H]trapoxin and was completed by the simultaneous addition of 
unlabled trapoxin. Trichostatin also competed with pHJtrapoxin (for synthesis see 
Example 2), suggesting that both of these compounds exert their cellular effects by 
targeting the same molecule. 

If trapoxin and trichostatin induce cell cycle arrest by directly inhibiting histone 
deacetylase, then the binding and enzymatic activities should copurify. To investigate 
this possibility, we fractioned nuclear thymus proteins by ammonium sulfate precipitation 
and Mono Q anion exchange chromatography. 

Briefly, thymocytes (-12 g) prepared from fresh bovine thymus were 
homogemzed in hypotonic lysis buffer [20 mM tris (pH 7.8), 20 mM NaCI 1 mM EDTA, 
lOo/o glycerol, ImM PMSF, ImM benzamidine, 10 ^g/ml each of pepstatin, aprotinin, 
and leupeptin] by mechanical disruption and the nuclei were isolated by centrifugation at 
3000g. Nuclei were resuspended in lysis buffer and the proteins were extracted with 0 4 
M ammonium sulfate. The viscous lysate was sonicated and clarified by centrifugation at 
100,000g for one hour. Proteins were then precipitated with 90% saturated ammonium 
sulfate and recovered by centrifugation (100,000g, one hour). After through dialysis 
against Q buffer (25 mM tris pH 8, 10 mM NH CI, 0.25 mM EDTA, 10% glycerol), a 
portion of the nuclear proteins (-12 mg total protein) was loaded onto a HR 10/10 Mono 
Q column (Pharmacia). The column was washed with 25ml Q buffer and eluted with a 
50 ml linear gradient of 10 to 500 mM NH4 CI. The column was further washed with 25 
ml 500 mM NH4 and 25 ml 1 M histone deacetylase activities or further purified with 
the K-trap affinity matrix. All procedures were done at 40C. 

Two peaks of histone deacetylase activity eluted from the Mono Q column 
between 250 and 350 mMNH4Cl (Figure IB). Trapoxin binding activity, as revealed by 
the charcoal precipitation assay (40 nM pHltrapoxin), precisely coeluted v^th the histone 
deacetylase peaks. Furthennore, all detectable histone deacetylase activity was abolished 
by treatment with either trapoxin or trichostatin (20 nM). Similar results were obtained 
with Mono Q fractioned nuclear extracts prepared form human Jurkat T cells. 

To purify the histone deacetylase further, we synthesized an affinity matrix based 
on the trapoxin stnicture. Because trapoxin itself is not amenable to derivatization and 
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the epoxyketone side chain is indispensable for activity, we chose to replace one of the 
phenylalanine residues of trapoxin's cyclic core with a lysine that could then be covalently 
linked to a solid support. This molecule, which we call K-trap, was prepared by a twenty 
step synthesis starting with commercially available (R)-proline and (S,S)- threiitol 
5 acetonide (Figure 2A) (see Example 3). Synthetic K-trap inhibited [3H]thymidine 
incorporation in MG-63 human osteosarcoma cells with a potency approximately one 
tenth that of trapoxin. In vitro histone deacetylase activity was also inhibited potently by 
this compound (complete inactivation at 20 nM) (8). 

K-trap was deprotected with Pd(Ph3P)4 and coupled to an activated agarose 
10 matrix (Figure 2 A). Mono Q fractions containing nuclear proteins from bovine thymus 
were incubated with the K-trap affinity matrix and then tested for both trapoxin binding 
and histone deacetylase activity. Both activities were depleted (90%) by treatment with 
the K-trap matrix, yet a control matrix capped with ethanolamine had no effect on either 
activity (8). Bound polypeptides were eluted by boiling the matrix in 1% SDS buffer and 
15 separated b polyacrylamide gel electrophoresis. In vitro binding experiments with soluble 
[^Hjtrapoxin indicated that the radiolabel is released into solution following protein 
denaturation with SDS or gunaidinium hydrochloride. Thus, trapoxin binding proteins 
were expected to elute from the affinity matrix with SDS. 

The silver stained gel of the affinity matrix eluates revealed six major polypeptides 
20 with apparent molecular sizes between 45 and 50kD (Figure 2B). The interaction 
between bovine p46-p50 and the K-trap matrix appeared to be specific, because these 
proteins were not retained when the incubation was done in the presence of either 
trapoxin or trichostatin (Figure 2B), nor were they structurally unrelated histone 
deacetylase inhibitor, trichostatin, to prevent p46-p50 from binding to the K-trap matrix 
25 implies that one or more of these polypeptides constitute the biologically relevant protein 
target of both trapoxin and trichostatin. When the affinity purification was repeated with 
Jurkat nuclear extracts, only two major bands, p50 and p55, were observed by silver 
staining (Figure 2B). Recovery of human p50 and p55 was similarly abolished by 
trapoxin (Figure 2B) and trichostatin (8). Because the relative intensities of bovine p46- 
30 p49 vary with each protein preparation, we suspect that they are proteolytic fragments 
derived from the bovine equivalent of human p55. One of the bands (p50) is common to 
both human and bovine sources. 

Large scale purification of the bovine proteins led to the resolution of two major 
bands of --46 and --50 kD in the final preparative electrophoresis step, both of which 
35 were submitted for microsequencing. 
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To obtain enough trapoxin binding protein for microsequencing, nuclear 
ammonium sulfate pellets from 15 bovine thymuses were prepared as described above. 
Sedimented proteins were resuspended in and dialyzed against buffer A [20 mM bistris 
(pH 7.2), 20 mM NaCl, 10% glycerol] for 12 hours, and brought to pH 5.8 by dialyzing 
5 against bugger A (pH 5.8) for 30 minutes. After centrifugation, the dialysate (-650 mg 
protein) was loaded onto a Q Sepharose FF column (2.6 x 10 cm; Pharmacia) and the 
column was washed with 120 ml buffer A (pH 5.8). Proteins wee eluted with a 400 ml 
Imear gradient of 20 to 600 mM NaCl in buffer A. Fractions (10 ml; each fraction 
contained 1 ml of 1 M tris pH 8 to neutralize the acidic buffer A) were assayed for 
10 trapoxin binding activity. Tween-20 was added to active fractions at a final 
concentration of 0.05%, and these fractions were incubated with K-trap afifinity matrix 
for 16 hours (25 ^1 per ml Q fraction). After washing the matrix three times with 
phosphate buffered saline, bound proteins were eluted by boiling in 40 nl of SDS sample 
buffer per 25 ni of matrix. SDS eluates were combined and the proteins resolved by 
SDS-PVDF membrane (Biorad). Staining with Ponceau S revealed two major bands (46 
and 50 IcD). The excised bands were proteolytically digested and the HPLC purified 
peptide fragments were sequenced at the Harvard Microchemistry Facility. 

The bovine protein of larger molecular size (-50 kD) corresponds to a known 
protein, RbAp48 (11), that consists of seven WD repeat domains (12). Originally 
identified as a protein that binds to the retinoblastoma gene product (pRb), RbAp48 may 
constitute an adaptor subunit that targets the histone deacetylase to specific chromatin 
domains. 

The -46 kD bovine protein is highly related to the protein encoded by the yeast 
RPD3 gene, which has been implicated by several genetic screens as a transcriptional 
regulator, but whose biochemical fiinction is unknown (13). Partial cDNA sequences for 
the human gene were identified in the expressed sequence tag database (dbEST) and 
were used to design polymerase chain reaction (PCR) primers. Briefly, after noting 
sequence similarity between peptides derived from the purified bovine trapoxin binding 
protein and yeast RPD3, we checked dbEST to see whether any partial sequences for the 
human homologue had been reported. . Two ESTs (Genbank accession numbers: 
D31480 and F07807) were identified whose predicted translation products aligned with 
high sequence similarity to NH2- and COOH-terminal regions of ^Z>7, respectively, PCR 
primers were designed based on these tags and a one kilobase PCR product was obtained 
from a Jurkat cDNA library (Stratagene). A 32p ,abded probe prepared by random 
priming was used to screen the Jurkat library, and ten positive clones were isolated. One 
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of the clones was fiilly sequenced and found to contain a putative full-length open 
reading frame (Figure 3A). The peptide sequences obtained from the purified bovine 
protein align with 100% identity to sequences deduced from this coding region (Figure 
3A, boxed residues). We call this human protein HDl (for histone deacetylase), and its 
predicted size of 55 kD agrees well with the estimated size of p55 isolated from Jurkat 
nuclear extracts using the K-trap affinity matrix (Figure 2B). A dbEST search indicated 
the existence of at least two other related human genes. 

To determine the relationship between the proteins from bovine thymus (p46- 
p50) and the proteins isolated from human Jurkat T cells (p50 and p55), an antiserum 
was generated against a peptide specified by the HDl open reading frame (Figure 3A, 
amino acids 319 to 334). Immunoblot analysis of the bovine proteins p46-p49 and the 
human protein p55 showed that they all react with the antiserum and provides additional 
evidence that these bands correspond to bovine and human HDl (Figure 3B). A 
monoclonal antibody that specifically recognizes RbAp48 was used to confirm the 
identity of bovine and hum p50. Importantly, neither HDl nor RbAp48 was detected 
when the affinity purification was done in the presence of trapoxin or trichostatin (Figure 
3B). 

We used affinity purified antibodies directed against a COOH-terminal peptide 
(amino acids 467 to 482) to immunoprecipitate HDl from crude nuclear extracts. The 
immunoprecipitates contained histone deacetylase activity that was inhibited by both 
trapoxin and trichostatin (Figure 4A). Consistent with the idea that HDl and RbAp48 
form a complex in vivo, the two proteins coprecipitated with the anti-DHl antibodies 
(Figure 4B). Neither HDl, RbAp48, nor the associated histone deacetylase activity were 
immunoprecipitated in the presence of the HDl COOH-terminal peptide (Figure 4A and 
4B) (15). HDl, like RbAp48 (11), is detected predominantly in the nucleus by 
immunostaining with the aforementioned antibodies (8). Given that HDl and RbAp48 
are the major proteins eluted from the K-trap matrix (Figure 2B), it is likely that they 
interact directly with one another. 

We extended the results obtained with the endogenous protein by expressing 
recombinant FLAG epitope tagged HDl (HDl-F) in Jurkat T cells. Anti-FLAG 
immunoprecipitates from cells transfected with pBJS/HDl-F contained histone 
deacetylase activity that was sensitive to both trapoan and trichostatin (Figure 4C). 
Histone deacetylase activity was not precipitated when the antibody was blocked with 
excess FLAG peptide (15). Interestingly, endogenous RbAp48 did not coprecipitate with 
overexpressed HDl-¥ (8), demonstrating that RbAp48 is not required for either histone 
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deacetylase or trapoxin binding activity. The result is consistent with the idea that 
RbAp48 serves a targeting rather than an enzymatic function. FinaUy, lysates from cells 
transfected with pBJ5/HDJ-F were incubated with the K-trap affinity matrix in the 
presence or absence of trapoxin and trichostatin. Protein immunoblot analysis 
demonstrated an interaction between recombinant HDJ-F and the K-trap affinity matrix 
that was fully competed by nanomolar concentrations of trapoxin or trichostatin (Figure 

HDJ is 60% identical to the protein encoded by the yeast RPD3 gene, which was 
isolated in four independent mutant suppressor screens designed 'to identify 
transcriptional repressors (13. 16, 17, 18, 19). No biochemical function for the yeast 
protein has previously been postulated. A negative regulator of the TRK2 gene, RPD3 is 
necessary for the transcriptional repression of several genes whose expression is 
regulated according to specific environmental conditions. Loss of RPD3 also leads to 
decreased transcriptional activation of certain genes, but this eflFect may be indirect (13 
17). Although RPD3 had yet to be implicated in silencing at telmomeres or the mating 
loci, the fact that silencing is eliminated by point mutations in specific lysine residues near 
the NH2-terminus of histones H3 and H4 suggests that lysine deacetylation may 
contnbute to the maintenance of silenced chromatin (20, 21, 22, 23). Indeed, silencing at 
telomeres and the mating loci has been correlated with the presence of hypoacetylated 
histones. and sir mutants which are defective in silencing show a corresponding increase 
m the extent of histone acetylation at these loci (24). The SIR3 and SIR4 proteins have 
been shown to interact with a bacterially expressed histone H4 NH2-terminal domain in 
Vitro (25), and it is possible that deacetylation of one or more lysine residues is required 
for this interaction in vivo. Our results further support a . role for histone deacetylase as a 
transcriptional regulator and establish a biochemical connection to the genetic studies that 
origmally characterized RPD3. 

How does inhibition of histone deacetylase in mammalian cells lead to Gi and G, 
phase cell cycle arrest? One possibility is that specific cell cycle regulatory proteins such 
as the cyclm dependent kinase inhibitors are transcriptionally upregulated in response to 
histone deacetylase inactivation. Alternatively, cell cycle checkpoints may exist that 
momtor histone acetylation or higher-order chromatin structure. It should now be 
possible to study the regulation of histone deacetylase during the cell cycle, its substrate 
specificity, and the mechanism by which it is targeted to specific regions of the genome 
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Example 2 



3H-Trapoxin was prepared from (S,S)-threitoI acetonide (9) by total synthesis, as 
outlined in Figures 8A-8C. 

As shown in Figure 8 A, (S,S)-threitol acetonide (9) was monoprotected by 
treatment with triisopropylsilylchloride (TIPSCl) and sodium hydride in tetrahydrofuran 
(THF). The free alcohol was then subjected to Swem oxidation. Wittig reaction of the 
resulting aldehyde gave compound 10 in good yield for the three steps. Compound 10 
was then hydrogenated with deprotection of the primary alcohol, which was then 
converted to the bromide 11 in excellent yield. Bromide 11 was converted to the 
organocuprate and reacted with (S)-serine p-lactone to yield the benzyloxycarbonyl- 
(Cbz) protected amino acid 12. 

As shown in Figure 8B, 12 was coupled to tripeptide methyl ester 14, and the 
methyl ester was saponified. The amino acid was then cyclized and the silyl protecting 
group was removed to yield cyclotetrapeptide 18 in 51% yield. 

Cyclotetrapeptide 18 was tritiated, as shown in Figure 8C, by oxidation of the 
primary alcohol with the Dess-Martin reagent, and the aldehyde was reduced with 
tritiated sodium borohydride to provide tritiated 18, which was converted to 
[3H]Trapoxin B by tosylation of the primary alcohol, deprotection of the diol, epoxide 
ring closure, and oxidation of the secondary alcohol to yield the desired compound. 
20 Non-radiolabelled 18 was converted to [^HjTrapoxin B, via tosylate 19, in 68% overall 
yield. 



Example 2 

K-Trap was prepared from (S,S)-threitol acetonide (9) by total synthesis, as 
25 outlined in Figures 9A-9C. As shown in Figure 9A, monoprotection and Swem 
oxidation of 9 yielded the aldehyde as above. Wittig homologation yielded carboxyiic 
add 20, which was converted to the mixed anhydride and treated with lithiated 
oxazolidinone 21 to provide 22 in excellent yield. Deprotection of the primary alcohol 
and conversion to the tosylate were followed by treatment of the potassium enolate with 
30 trisylazide according to the method of Evans to effect electrophilic azide transfer in good 
overall yield and stereoselectivity, providing compound 23. Removal of the chiral 
auxiliary and catalytic reduction of the azido function, with hydrogenation of the olefin, 
provided amino acid 24, which was N-protected to give the Fmoc derivative 25 in high 
overall yield. 
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Referring to Figure 9B. protected amino acid 25 was coupled to tripeptide methyl 
ester 26. The methyl ester was saponified to yield 27, which was cyclized under high- 
dilution conditions to provide cyclotetrapeptide 28 in 58% yield. . 

As shown in Figure 9C, compound 28 was converted to K-trap (29) by 
deprotection of the diol, base-promoted epoxide closure, and oxidation of the secondary 
alcohol to provide K-trap (29) in good overall yield. The K-trap afiSnity matrix 30 was 
provided by palladium-catalyzed removal of the allyloxycarbonyl (Alloc) group fi-om the 
lysine residue of 29, and immobilization on Affigel 10. 
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Example 3 
Histone Deacetylase Activity is Required 
for Full Transcriptional Repression by mSin3 A 
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20 



The Mad family of basic region-Helix-Loop-Helix-Leucine Zipper (BHLHZip) 
proteins play an important role in controlling cell proliferation and differentiation (for 
reviews see: Axnati and Land, 1994; Bernards 1995). Four identified Mad family 
members: Madl, Mxil, Mad3 and Mad4 (Ayer et al., 1993; Zervos et al., 1993; Hurlin et 
al., 1995a) form heterodimers with another BHLHZip protein, Max; to repress 
transcription (Ayer et al., 1993; Hurlin et al., 1995a) and are thought to play a negative 
role in the control of cell proliferation. 

Two mammalian homologs of the Saccharomyces cerevisiae transcriptional 
corepressor SIN3, mSin3A and mSin3B, have recently been identified as Mad interacting 
proteins and are required for Mad-mediated transcriptional repression (Ayer et al., 1995; 
Schreiber-Agus et al., 1995). The most conserved regions of these proteins correspond 
to four putative paired amphipathic helix (PAH) motifs, which have been proposed to 
constitute protein-protein interaction surfaces (Wang et al., 1990). The second PAH 
motif in mSin3A, mSin3B and Sin3p interacts with the mSin3 interaction domain or SID 
in the amino terminus of the four Mad family members (Ayer et al., 1995; Schreiber-Agus 
et al., 1995; Hurlin et al., 1995a; Kasten et al.. 1996). Madl, Max and mSin3A form 
ternary complexes capable of binding DNA (Ayer et al., 1995). Point mutations in the 
Sm domain of Madl disrupt its ability to bind mSin3A, negate its function as a 
trancriptional repressor (Ayer et al., 1995), and eliminate Madl function in several 
30 biological assays (Koskinen et al., 1995; Roussel et al., 1996). These findings suggest 
that Mad:Max heterocomplexes repress transcription by tethering either mSin3A or 
mSin3B to DNA. A chimeric protein fusing the SID of Madl to the GAL4 DNA-binding 
domain results in repression of simple and complex promoters in a manner that is 
dependent on mSin3 binding, suggesting that targeting mSin3 to DNA is necessary for 
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repression (Ayer et al., 1996). Nevertheless, the molecular mechanisin(s) for mSinSA- 
mediated repression remain unknown. 

As described in example 1, a mammalian histone deacetylase has been identified, 
and cDNAs encoding the protein, histone deacetylase 1 (HDl or HDACl for the 
5 purposes of this example), have been cloned (see also Taunton et al., 1996b). HDACl is 
approximately 60% identical to the S, cerevisiae RPD3 protein, v^hich is a component of 
a yeast histone deacetylase complex (Rundlett et al., 1996). Single mutations in either 
RPD3 or SIN3 give the same phenotypes as RPD3/SIN3 double, mutants suggesting that 
they function in the same pathway (Stillman et al., 1994). Because Mad family proteins 

10 use mSin3A as a corepressor and Madl can repress transcription in wild-type yeast but 
not yeast having a null mutation in SIN3 (Kasten et al., 1996) or RPD3 (D.J. Stillman, 
personal comm.), it is likely that the mechanism of transcriptional repression by Mad 
proteins may be conserved between yeast and higher eukaryotes. Consistent with this 
hypothesis, the DNA-binding transcription factor YYi interacts with a mammalian RPD3 

15 homolog to repress transcription of a heterologous reporter gene (Yang et. at, 1996). 
These results demonstrate that mammalian RPD3-Iike activity functions in transcriptional 
regulation. 

Several lines of evidence suggest that the acetylation status of conserved lysines in 
the amino terminal domains of histones H3 and H4 play a role in the regulation of 

20 transcription. In general, histone hyperacetylation correlates with transcriptionally active 
or poised genes; conversely, hypoacetylation correlates with transcriptionally repressed 
heterochromatin (for reviews see: Turner, 1993; Loidl, 1994; Wolffe, 1996). While little 
is known about the targeting and regulation of histone acetyltransferases and 
deacetylases, it has been recently shown that several transcriptional coactivators possess 

25 inherent acetyltransferase activity (Brownell et al., 1996; Ogryzko et al., 1996) or 
associate with acetyltransferases (Yang et al., 1996b). We report that mSin3A and 
HDACl associate in vivo and that the histone deacetylase inhibitor trapoxin interferes 
with mSin3A-mediated transcriptional repression. 

30 Results 

(i) mSin3A is present in cells as a large stable multiprotein complex. 

To study the in vivo fijnction of mSin3A we generated polyclonal antiserum 
specific for the PAH2 domain of mSin3A. We tested this antiserum by 
immunoprecipitation using nuclear lysates made from the myeloid leukemia cell line U937 
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that had been metabolically labeled with "s-methionine. Analysis of immunoprecipitates 
showed an intensely-labeled doublet with an apparent molecular weight of 150 
kiloDaltons that was present in the anti-mSin3A immunoprecipitates (Figure lOA). This 
doublet comigrated with in vitro translated-mSin3A, shared identical V8 protease 
digestion peptides with in vitro translated-Sin3A and was absent from 
immunoprecipitations using preimmune serum or immune serum preincubated with the 
cognate immunogen (data not shown). 

Fractionation of U937 nuclear extracts by size exclusion chromatography 
indicated that mSin3A is present in large molecular weight complex(es) (D.E.A., 
unpublished). To address this possibility, we performed immunoprecipitations from 
metabolically-labeled U937 cells under conditions that should preserve protein-protein 
interactions. In addition to mSin3A, the low-stringency mSin3A immunoprecipitates 
contained several labeled polypeptides of apparent molecular weight 250 kDa, 180 kDa, 
55 IcDa, 50 kDa. 42 kDa, 33-36 kDa and 30 kDa (Figure lOA). These proteins were not 
detected in immunoprecipitates using mSin3A antiserum blocked with the cognate 
immunogen, suggesting that the proteins detected are specifically associated with 
mSin3A. Furthermore, none of these proteins were detected using high-stringency 
immunoprecipitation or by western blotting of whole-cell lysates using anti-Sin3A, 
suggesting that they do not share epitopes with mSin3A and are not proteolytic 
breakdown products of mSin3A (data not shown). All of the associated proteins appear 
to be present in substoichiometric amounts to mSin3A, suggesting that mSin3A 
complexes are heterogeneous. 

To test the stability of the mSin3 A complex, we subjected low-stringency mSin3 A 
immunoprecipitates to different salt concentrations and ionic detergent conditions. The 
proteins that remained bound to mSin3A in the immunecomplex were analyzed by SDS- 
PAGE. Under the most stringent conditions we observed only a slight loss of mSin3A- 
associated proteins in the immune complex (Figure lOB). One exception to this finding 
was the apparently quantitative loss of p42 under slightly-elevated salt concentrations. 
These findings demonstrate that the mSin3A complex is stable in vivo and suggests that 
some or all of the mSin3A-associated proteins may facilitate mSin3A function as a 
transcriptional co-repressor. 



(ii) HDACl and RbAp48 are components of the mSin3A complex. 
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Because SIN3 and RPD3 appear to function in the same pathway in yeast and two 
components of the mSin3A complex, p50 and p55, are similar in apparent molecular 
weight to HDACl we hypothesized that HDACl or related proteins might be 
components of the mSin3A repressor complex. To test this hypothesis, the proteins 
bound to mSin3A immunecomplexes were eluted Avith ionic detergents and reprecipitated 
with affinity purified antibodies specific for an internal peptide of HDACl. Only two 
proteins eluted from the mSin3A complex were re-precipitated by HDACl antiserum 
(Figure 11 A). These polypeptides comigrated with p50 and p55 from the low stringency 
mSin3A immunoprecipitation, suggesting that proteins highly related to HDACl are 
complexed to mSin3A in vivo. p55 comigrates with in vitro translated HDACl and is 
recognized by an antibody specific for a carboxy-terminal epitope unique to HDACl. 
Another CDNA encoding an HDACl homolog, HDAC2, has recently been identified 
(Yang et al.. 1996a). It is likely that p50 represents HDAC2 (data not shown). 

In a reciprocal experiment, we performed low stringency immunoprecipitations 
15 using antiserum specific for an epitope at the carboxy-terminus of HDACl. HDACl 
immunoprecip-itates contain several proteins that were specifically competed with the 
immunizing peptide (Figure 1 IB). A polypeptide doublet that comigrated with mSin3A 
was detected in the HDACl immunocomplexes (Figure IIB and 1 IC). To confirm that 
the doublet coprecipitating with HDACl is mSin3A, the HDACl immunocomplex was 
20 eluted and reprecipitated with antiserum specific for mSin3A (Figure 11 C). The two 
proteins in this precipitate comigrated with mSin3A, confirming that mSin3A and 
HDAC 1 are associated in vivo. 

To determine whether HDACl associated with mSin3A in vivo is enzymatically 
active, we assayed low-stringency immunoprecipitates for histone deacetylase activity. 
25 We used a synthetic peptide corresponding to the first twenty four amino acids of histone 
H4 as a substrate for our deacetylase assay (Taunton et al., 1996b). Low-stringency anti- 
mSin3A immunoprecipitates contained deacetylase activity; however, only background 
levels of deacetylase activity were detected in the immunoprecipitates if the mSin3A anti- 
serum was blocked with cognate immunogen (Figure IID). To confirm the authenticity 
of the mSin3A associated activity we treated the immunoprecipitates with synthetic 
trapoxin, a specific inhibitor of histone deacetylase activity (Taunton et al., 1996a). 
Treatment of mSin3A complexes in vitro with 10 nM trapoxin reduced deacetylation by 
approximately 50% (Figure IID), suggesting that the precipitated deacetylase activity 
can be attributed to trapoxin-sensitive histone deacetylases bound to mSin3 A 
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We detected an interaction between HDACl and RbAp48 in vivo (Figure IIB, 
1 IC and Example 1). The low-stringency mSin3 A immunoprecipitation shown in Figure 
lie also contained a protein that comigrated with RbAp48 (marked with an asterisk) that 
was not readily visible on the shorter exposures of low stringency mSinS 
immunoprecipitations (Figure II A). We have identified RbAp48 in mSinSA 
immunoprecipitates from cell extracts of nontransfected cells by western blotting, further 
demonstrating that mSin3 A and RbAp48 associate in vivo (Figure 12A). 

To address further the association between HDACl and RbAp48 with mSin3A, 
we expressed the mammalian proteins in insect cells using recombinant baculoviruses. To 
this end, we expressed recombinant FLAG-epitope tagged HDACl (HDACI-F) that 
could be immuno-purified by anti-FLAG antibodies and histidine-tagged mSin3A 
(mSin3A-H) that could be purified by nickel affinity (data not shown). HDACl-F was 
immunoprecipitated from infected Sf9 cell extracts by anti-FLAG antibodies in the 
presence or absence of mSin3A-H. HDACl-F was also precipitated by Ni^*-NTA 
15 agarose in a manner that was dependent on coexpression of mSin3A-H (Figure I2B), 
demonstrating that a complex between HDAC 1 and mSin3A is formed in insect cells 
using exogenously expressed human proteins. 

Consistent with our finding that RbAp48 is associated with mSin3A and HDACl 
in vivo, we show that baculovirus expressed Flu-epitope tagged RbAp48 (p48-HA) is 
specifically precipitated from infected Sf9 cell extracts using anti-FLAG antibody only 
when HDACl-F is coexpressed. Furthermore, p48-HA is specifically retained by Ni^*- 
NTA in the presence of mSin3A-H (Figure 12C). Co-expression of p48-HA did not 
appear to effect the association between HDACl-F and mSin3A-H, suggesting that the 
regions of interaction are distinct and that all three proteins can associate simultaneously. 
These data suggest a direct interaction between mSin3A, HDACl and RbAp48 in vivo. 

(iii) Transcription repression by mSin3A requires histone deacetylase activity. 

To investigate whether histone deacetylation plays a role in mSinSA-mediated 
transcriptional repression in vivo, we examined mSin3A-specific repression in the 
presence and absence of the histone deacetylase inhibitor trapoxin. 293 cells were 
transfected with a luciferase reporter gene construct containing a minimal promoter 
consisting of only a. TAT A box and initiation site derived from the myelomonocytic 
growth factor gene (Figure 13A). This reporter has four consensus binding sites for the 
DNA binding domain of the S. cerevisiae transcriptional activator GAL4 and therefore is 
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responsive to chimeric proteins containing the GAL4 DNA binding domain (GALDBD) 
(Stemeck et al., 1992). We have used this reporter construct previously to demonstrate 
that fusion of the SID repressor region of Madl to the GALDBD is necessary for 
mSinSA-dependent transcriptional repression. Furthermore, we have shown that fusion 
of SID to the potent transcriptional activator GALVP16, MadN35GALVP16, can cancel 
the activation function of VPI6 in an mSinSA-dependent manner. Consistent with our 
previous results (Ayer et al., 1996), MadN35GALVP16 activated transcription from the 
reporter gene approximately 100-fold less well than GALVP16 (data not shown). As a 
negative control we engineered two proline substitutions into the SID of Madl, 
Mad(Pro); this protein cannot bind mSin3A in vitro (Ayer et al., 1995). Consistent with 
an inability to interact with mSin3A, Mad(Pro)GALVP16 is a much less potent repressor 
(Figure 13B). In control experiments we have shown that the observed effects require 
the presence of GAL4 sites in the promoter and that both MadN35GALVP16 and 
Mad(Pro)N35GALVP16 are expressed to equivalent levels in these cells and bind GAL4 
sites with similar affinities (data not shown). To test the role of histone deacetylation on 
the repression observed in our transfection assays, we first examined the effect of 
trapoxin on histone deacetylase activity in 293 cells. As expected, in vivo treatment with 
10 nM trapoxin for eight hours reduced deacetylase activity of both crude 293 extracts 
and anti-HDACl immunopurified complexes by approximately 46% and 58%, 
20 respectively (Figure 13 C). 

To test the effect of a histone deacetylase inhibitor on MadN35GALVP16 and 
Mad(Pro)N35GALVPI6 mediated repression, we treated a dupUcate set of transfections 
with 10 nM trapoxin for eight hours prior to harvest. In the representative experiment 
shown, 10 nM trapoxin treatment derepressed the activity of MadN35GALVP16 nine- 
fold while it had litde effect on the activity of Mad(Pro)N35GALVP16, suggesting that 
the histone deacetylation plays a direct role in mSin3A transcriptional repression (Figure 
13B). In addition, there was typically less than a two-fold effect of trapoxin on the 
activity of the reporter cene in cells transfected with the expression vector alone or in 
cells transfected with GALVP16 (data not shown). Following trapoxin treatment, the 
repression observed for MadN35GALVP16 was still seven times greater than that of 
Mad(Pro)N35GALVP16, suggesting that the residual deacetylase activity following 
trapoxin treatment (Figure 13B) continues to drive mSin3A-mediated repression; 
however, we can not rule out that mSin3A is capable of repression by mechanisms 
independent of histone deacetylation. 
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Discussion 



Earlier studies implicated mSin3 as the primary candidate for the negative 
transcriptional function of the DNA binding transcription factor Mad (Aver et a!., 1995; 
Schreiber-Agus et al. 1995; Ayer et al., 1996). We present evidence that the mSinSA 
5 compressor is part of a high molecular weight, multicomponent complex(es) that contains 
active histone deacetylase, thereby implicating histone deacetylation as a potential 
mechanism for mSin3A-mediated repression. Furthermore, we observe a pronounced 
increase in the transcriptional activity of an mSinSA-silenced reporter gene upon 
treatment in vivo with the specific histone deacetylase inhibitor trapoxin, suggesting that 
10 full transcriptional repression by mSin3A requires histone deacetylation. These results 
suggest a mechanism of gene regulation through the targeting of an enzyme that alters 
chromatin structure. 

These observations are consistent with genetic experiments in yeast, suggesting 
that the yeast orthologs of mSin3A and HDACl, SIN3 and RPD3 respectively, are 
15 epistatic transcriptional regulators (Stillman et al., 1994). Furthermore, recent 
biochemical evidence demonstrates that Rpd3p is a component of a large molecular 
weight histone deacetylase complex in yeast (Rundlett et al., 1996). Together with our 
results, these findings predict a conservation of the mSin3/HDACl functional association 
in yeast. 

20 We have used chimeric transcriptional regulators to discern the eflfects of trapoxin 

on the activity of our reporter genes. The MadN35GALVP16 chimera functioned as a 
repressor by a mechanism that was dependent on the binding of mSin3A and that was 
sensitive to trapoxin. The same mutations that inactivate MadN35GALVP16 as a 
transcriptional repressor (i.e. Mad(Pro)N35GALVP16), also block interaction between 

25 Madl and mSin3A in vitro and Madl function in vivo. Therefore, it is likely that 
Mad:Max heterocomplexes repress transcription in a manner dependent on an mSin3A- 
associated histone deacetylase. 

By co-immunoprecipitation we have demonstrated that mSin3A and HDACl 
associate in vivo. Consistent with these data we observed nuclear colocalization of 
30 mSin3A and HDACl by immunofluorescence microscopy (data not shown). Finally, 
overexpression in insect cells facilitates co-purification of mSin3A and HDACl (Figure 
12B and C), suggesting that the interaction between mSin3A and HDACl is either direct 
or requires a conserved cofactor. The finding that mSin3A has different associated 
histone deacetylases (HDACl and HDAC2) suggests that the mSin3A complex(es) may 
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have multiple substrate or target specificities. The heterogeneous nature of the mSinSA 
complex potentially reflects a diverse array of repressors, histone deacetylases and 
different targeting molecules that facilitate mSinSA-dependent alterations in gene 
expression. 

5 At least five additional polypeptides are stably associated with mSinSA (Figures 

10 and 1 1); whose function is currently unknown but tight association with mSinSA in 
both U937 cells and Jurkat T cells (data not shown) suggests that they in some way 
mediate mSin3A fiinction. Furthermore, we have identified an association between 
mSin3A and RbAp48 in vivo, suggesting that this protein may play a role in regulating 

10 mSin3A-targeted deacetylation. RbAp48 was originally identified as a retinoblastoma 
binding protein that contains WD repeats and shares homology with the -subunit of G- 
proteins (Qian et al., 1993). Subsequently, it has been shown that RbAp48 or its 
orthologs are involved in targeting different histone modifying enzymes to chromatin 
(Parthun et al., 1996; Taunton et al., 1996b; Tyler et al., 1996; Verreault et al., 1996). 

15 The mSin3A/RbAp48 complex isolated fi-om U937 cells (Figure 1 1) is likely to represent 
only a small fi-action of the mSin3 A complexes, but its detection implies that mSin3 A may 
play a role in the control of different aspects of chromatin physiology as well as 
transcription repression. 

It is unclear how different chromatin states facilitate transcription represision and 

20 activation or how their distinct biochemical states arise; however, there is ample 
cytological, genetic and biochemical evidence supporting the model that hyperacetylated 
chromatin is transcriptionally more active than hypoacetylated chromatin. Acetylation 
levels in -heterochromatin of Drosophila melanogaster pol>tene chromosomes are 
significantly reduced at lysine positions 5, 8, and 16 of histone H4, while the 

25 transcriptionally hyperactive X-chromosome of male flies is uniquely hyperacetylated at 
position 16 (Turner et al., 1992). In yeast, mutation of acetyl-accepting lysines in histone 
H4 reduces the activity of the GALI. PHOS and CUPI promoters in vivo purrin et al., 
1991). The transcriptionally silent regions in yeast, HML and HMR, are hypoacetylated 
and their activation is correlated with acetylation of histone H4 (Braunstein et al„ 1993). 

30 Additionally, biochemical studies showed that certain transcription factors have higher 
affinity for their binding sites when those sites are embedded in chromatin assembled fi^om 
hyperacetylated histones (Lee et al., 1993; Vettes-Dadey al., 1996). Finally, evidence 
suggesting that acetylation is required for activation comes fi-om the recent demonstration 
that several coactivators either encode acetyltransferases or are associated with 

35 acetyltransferases. Thus, our data support this general model for the control of gene 
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expression by histone acetylation status and provide a biochemical mechanism for 
deacetylation-mediated repression. 

The acetylalion status of a particular chromatin region represents a balance 
between competing, acetylation and deacetylation reactions. We propose that 
5 MadN35GALVPI6 recruits mSinSA-HDAC complexes to specific sites on DNA and 
shifts this equilibrium towards deacetylation and subsequent transcription repression by 
creating a high effective molarity of the histone deacetylase. In yeast, the activation 
domain of VP 16 has been shown to use the acetyltransferase GenSp as a coactivator 
(Marcus et al., 1994; Brownell et al., 1996), suggesting that in mammalian cells VP 16 
10 will also use an acetyltransferase as a cofactor. Thus, trapoxin treatment could shift the 
equilibrium from deacetylation to acetylation and thereby drive activation. 

Whether histone deacetylation will always have a negative effect on gene 
expression is unclear. Mutants in SIN3 and RPD3 can have both positive and negative 
effects on gene expression (Vidal and Gaber, 1991; Yoshimoto et al., 1992); however, 

15 for SIN3 there is evidence that positive effects may be indirect (Wang et al., 1994). In 
addition, mutations or deletions in RPD3 have recently been shown to enhance telomeric 
silencing both in yeast and in fhut fly (Sussel et al., 1995; De Rubertis et al., 1996; 
Rundlert et a!., 1996). In mammalian cells, deacetylase inhibitors can inhibit MyoD- 
(Johnston et al., 1992) and steroid receptor-activated transcription (McKnight et al., 

20 1990; Bresnick et al., 1990). While it remains to be shown that the effects of RPD3 on 
silencing are direct, this evidence suggests that histone deacetylation can elicit both 
positive and negative effects on gene expression. Determining the factors that govern the 
functional outcome of histone deacetylation will provide fertile ground for further 
experimentation. 

25 

Experimental Procedures 

Antibodies, cell culture, and Immunoprecipitations: To generate antiserum 
specific for mSin3A a GST fusion protein encoding, amino acids 251 through 405 of 
mSin3A was used to immunize a New Zealand White rabbit. The crude serum was 
30 passed over a GST column to remove the anti-GST antibodies. U937 cells were grown 
in RPMI supplemented with 10% calf serum (Hyclone), glutamine and penicilUn- 
streptomycin. Low and high stringency immunoprecipitations were performed essentially 
as described (Ayer and Eisenman, 1993). To elute proteins fi-om low stringency 
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immunoprecipitates, they were incubated for 60 minutes at room temperature in antibody 
buffer and reprecipitated under high stringency conditions. 

Luciferase assays: 293 cells were seeded in triplicate onto 60 mm dishes at 3x10^ 
cells in 4 ml DME with 10% calf semm (Hyclone). Six hours after seeding, cells were 
transfected with 50 ng luciferase reporter, 50 ng CMV-P-gal. 50 ng expression construct, 
and 2.85 ng carrier DNA using the BBS/CaP04 method. 10 nM trapoxin was added to 
the media 8 hours prior to the luciferase assays. Cell lysates were prepared 20-24 hours 
following transfections, and luciferase and P-galactosidase activities was assayed 
according to manufacturer directions (Promega, Tropix). Luciferase values (relative light 
units) were normalized for transfection efficiency by dividing by P-gal activity. 

Histone deacetylase assays: In vitro histone deacetylase activity was assayed 
essentially as described with either 50 ^1 of crude cell extract (approximately 5X10* 
Cells) or immunopurified cell extracts (approximately 2 X lO' cells) for 2.5 hrs at 37 C 
(Taunton et al., 1996b). Pretreatment of crude or immunopurified extract with synthetic 
trapoxin was performed for 30 minutes at 4 C prior to addition of peptide substrate. TAg 
Jurkat and 293 cell extracts for histone deacetylase assays were prepared as in Taunton et 
al., 1996. Anti-HDACl and anti-mSin3A immunoprecipitations were performed as 
described above and in Figure 11. The Protein-A conjugated immunoprecipitates were 
washed three times in J-buffer plus I mM EDTA and resuspended in J-buflFer without 
20 Tnton-X-lOO, and histone deacetylase activity was measured as described. 

Baculoviruses: cDNAs encoding Flag-tagged HDACl, HA-tagged RbAp48 and 
His-tagged mSin3A were cloned into the transfer vector pVL 1392 (specific details on 
the construction of these vectors is available upon request). Recombinant virus was 
generated using Baculogold DNA according to the manufactures instructions 
(Pharminigen). Sf9 or High 5 cells were infected at high multiplicity, extracts prepared 
48 hours post infection and immunoprecipitations performed as described above. Ni^^ 
NTA agarose and anti-Flag antibody were purchased fi-om Qiagen and Kodak-IBI, 
respectively. 
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w'irS \ ^K-^?'' ^x'/"'' ^ P'"^*^*" that specifically interacts 

with Max to bind Myc-Max recognition sites. Cell 72, 223-32- 

Example 4 

Since the priority date of this application, a number of other mammalian HDx 
genes have been described in the literature. In particular, a mouse HDl clone is 
Identified m GenBank as accession number U807080. Another HDx member HD2 
(HDAC-2) is also described for both human and mouse; see for example, Ge'nBank 
entnes U31814 and U31758. Without exception, each done includes a v motiff 
represented in the general formula of SEQ ID No. 12, and a x motif represented in the 
general formula SEQ ID No. 14. 



All of the above-cited references and publications are hereby incorporated by 
15 reference. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific polypeptides, nucleic acids 
methods, assays and reagents described herein. Such equivalents are considered to be 
20 within the scope of this invention and are covered by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Schrelber, Stuart L. 

Taunton, Jack 
HasBig, Christian A. 
aaunison, Timothy F. 

(ii) TITLE OF INVENTION: Histone Deacetylasee and Uses Related 
Thereto 

(iii) NUMBER OF SEQUENCES: 15 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FOLEY, HOAG Sc ELIOT, LLP 

(B) STREET: One Poet Office Square 

(C) CITY: Boston 
20 (D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 

(V) COMPUTER READABLE FORM: 
25 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: ASCII (text) 

30 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 26-MAR-1996 

( C ) CL AS S I F I CAT I ON : 

35 (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Vincent, Matthew P. 

(B) REGISTRATION NUMBER: 36,709 

(C) REFERENCE /DOCKET NUMBER: HUV019.25 

^0 (ix) TELECOMMUNICATION INFORMATION: 

<A) TELEPHONE: (617) 332-1000 
(B) TELEFAX: (617) 832--7000 



45 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



or in- ^wn 
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{ ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.1446 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



10 ^It IT: rtt Ty rTr r r ^« 

^ inr Gin Gly Thr Arg Arg Lys Val Cys Tyr Tyr Tyr Aap 



15 



15 



S?y ASP vll ttt f ^ f ^ ^'^^ ^ ^^^^ ATG AAG CCT 

Gly Asp val Gly Asa Tyr Tyr Tyr Gly Gin Gly His Pro Met Lys Pro 

25 3Q 

2s So ?r ^^"^ '''''' ^''^ ^^'^ TAT GGT CTC TAC 

Hxs Arg lie Arg Met Thr His Asn Leu Leu Leu Asn Tyr Gly LeS Jyr 

20 *° 45 

CGA AAA ATG GAA ATC TAT CGC CCT CAC AAA GCC AAT GCT GAG GAG ATG 
Arg Lys Met Glu lie Tyr Arg Pro His Lys Ala Asn Ala G^u Met 



55 60 



25 



?hr JJs Sr 2e f ^ ^^^'^ '^^'^ ^ CGT 

Thr Lys Tyr H.a Ser Asp Aap Tyr He Lys Phe Leu Arg Ser He Arg 

75 80 



85 90 



95 



35 



G?3 li? I '''''' ^""'^ ^"""^ GAG TTC TGT CAG TTG 

Gly Glu Asp Cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gin III 

105 110 

S^r ITy G?v III f 1 ^'^^ '^^^ ^ ^A*^ CAG 

r Thr Gly Gly Ser Val Ala Ser Ala Val Lys Leu Asn Lys Gin Gin 

40 ^20 125 

?hr aJp rlt IT "^^^ ^-^^ ^G AAG 

Thr Asp He Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 

135 140 

m ^f"" '''''^ "^"^ AAT GAT ATC GTC TTG GCC ATC 

ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp He VaJ Hu lit zH 

155 160 

50 ?T ^""^ CAG AGG GTG CTG TAC ATT GAC ATT GAT 

50 Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr H^ Ssp fll Asp 

170 175 

tH l^l S?! °CC TTC TAC ACC ACG GAC CGG 

Thr 
190 



lie His nil rT r , ^ ""^^ "^^^ ACC ACG GAC CGG 

55 " ""^P ^'i' '^^^ Ala Phe Tyr Thr Thr Asp Arg 

180 135 



96 



144 



192 



240 



30 P^o ASP M^t ser f ' ^BS 

ro Asp Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Asn Val 



336 



384 



432 



480 



576 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GTC ATG 
Val Met 



GGG GAC 
Gly Asp 
210 

AAC TAG 
Asn Tyr 
225 

TTC AAG 
Phe Lys 



ACT GTG TCC TTT 
Thr Val Ser Phe 
195 

CTA GGG GAT ATC 
Leu Arg Asp lie 



GTG GTC 
Val Val 



TGC TTC 
Cys Phe 



AAG AGC 
Lys Ser 
290 

ATT CGT 
lie Arg 
305 

GAT ACG 
Asp Thr 



CCG CTC CGA GAC 
Pro Leu Arg Asp 
230 

CCG GTC ATG TCC 
Pro Val Met Ser 
245 

TTA CAG TGT GGC 
Leu Gin Cys Gly 
260 

AAT CTA ACT ATC 
Asn Leu Thr He 
275 

TTT AAC CTG CCT 
Phe Asn Leu Pro 



CAT AAG 
His Lys 
200 

GGG GCT 
Gly Ala 
215 

GGG ATT 
Gly He 



TAT GGA GAG TAG 
Tyr Gly Glu Tyr 



AAA GTA 
Lys Val 



TCA GAC 
Ser Asp 



AAC GTT 
Asn Val 



GAG ATC 
Glu He 



TTT GGA 
Phe Gly 



CAG AAC 
Gin Aen 



AAC CTT 
Asn Leu 
370 

CCT GAG 
Pro Glu 
385 

CCT GAC 
Pro Asp 



CCA GAT 
Pro Asp 
340 

ACG AAT 
Thr Asn 
355 

AGA ATG 
Arg Met 



GCC CGG 
Ala Arg 
310 

CCT AAT 
Pro Asn 
325 

TTC AAG 
Phe Lys 



AAA GGA 
Lys Gly 
280 

ATG CTG 
Met Leu 
295 

TGC TGG 
Cys Trp 



GAG CTT 
Glu Leu 



GGC AAA GGC AAG 
Gly Lys Gly Lys 
220 

GAT GAC GAG TCC 
Asp Asp Glu Ser 
235 

ATG GAG ATG TTC 
Met Glu Met Phe 
250 

TCC CTA TCT GGG 
Ser Leu Ser Gly 
265 

CAC GCC AAG TGT 
His Ala Lys Cys 



TTC CCA GGA ACT 
Phe Pro Gly Thr 
205 

TAT TAT GCT GTT 
Tyr Tyr Ala Val 



ATG CTG GGA GGC 
Met Leu Gly Gly 
300 

ACA TAT GAG ACA 
Thr Tyr Glu Thr 
315 

CCA TAG AAT GAC 
Pro Tyr Asn Asp 
330 



TAT GAG GCC ATT 
Tyr Glu Ala He 
240 

CAG CCT AGT GGG 
Gin Pro Ser Ala 
255 

GAT CGG TTA GGT 
Asp Arg Leu Gly 
270 

GTG GAA TTT GTC 
Val Glu Phe Val 
285 

GGT GGT TAG ACC 
Gly Gly Tyr Thr 



GCT GTG GCC CTG 
Ala Val Ala Leu 
320 

TAG TTT GAA TAG 
Tyr Phe Glu Tyr 
335 



CTC CAC 
Leu His 



GAG TAG 
Glu Tyr 



CTG CCG 
Leu Pro 



GAC GCC ATC CCT 
Asp Ala He Pro 
390 

AAG CGG ATC TCG 
Lys Arg He Ser 



CTG GAG 
Leu Glu 
360 

CAC GGA 
His Ala 
375 

GAG GAG 
Glu Glu 



ATC TGC 
He Cys 



ATC AGT CCT TCC 
He Ser Pro Ser 
345 

AAG ATC AAA CAG 
Lys He Lys Gin 



CCT GGG GTC CAA 
Pro Gly Val Gin 
380 

AGT GGC GAT GAG 
Ser Gly Asp Glu 
395 

TCC TCT GAC AAA 
Ser Ser Asp Lys 



AAT ATG ACT AAC 
Asn Met Thr Asn 
350 

CGA CTG TTT GAG 
Arg Leu Phe Glu 
365 

ATG CAG GCG ATT 
Met Gin Ala He 



GAC GAA GAC GAC 
Asp Glu Asp Asp 
400 

CGA ATT GCC TGT 
Arg He Ala Cys 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



nrin- ^wn 
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405 410 

GAG GAA GAG TTC TCC GAT TCT GAA GAG GAG GGA GAG GGG GGC CGC AAG 1296 
Glu Glu Glu Phe Ser Asp Ser Glu Glu Glu Gly Glu Gly Gly Arg Lys 
420 425 430 

AAC TCT TCC AAC TTC AAA AAA GCC AAG AGA GTC AAA ACA GAG GAT GAA 1344 
Asn Ser Ser Asn Phe Lys Lys Ala Lys Arg Val Lye Thr Glu Asp Glu 
435 440 

AAA GAG AAA GAC CCA GAG GAG AAG AAA GAA GTC ACC GAA GAG GAG AAA 1392 
Lys Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lys 
450 455 460 



15 ACC AAG GAG GAG AAG CCA GAA GCC AAA GGG GTC AAG GAG GAG GTC AAG 
Thr Lys Glu Glu Lys Pro Glu Ala Lys Gly Val Lys Glu Glu Val Lvs 

470 475 480 



1440 



TTG GCC TGA 

20 Leu Ala ^^'^^ 



(2) INFORMATION FOR SEQ ID NO; 2: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs2: 





ATTGACTTCC 


TGCAGAGAGT 


CAGCCCCACC 


AATATGCAAG 


GCTTCACCAA 


GAGTCTTAAT 


60 


40 


GCCTTCAACG 


TAGGCGATGA 


CTGCCCAGTG 


TTTCCCGGGC 


TCTTTGAGTT 


CTGCTCGCGT 


120 




TACACAGGCG 


CATCTCTGCA 


AGGAGCAACC 


CAGCTGAACA 


ACAAGATCTG 


TGATATTGCC 


180 


45 


ATTAACTGGG 


CTGGTGGTCT 


GCACCATGCC 


TAGAAGTTTG 


AGGCCTCTGG 


CTTCTGCTAT 


240 




GTCAACGACA 


TTGTGTTTGG 


CATCCTGGAG 


CTGCTCAAGT 


ACCACCCTCG 


GGTGCTCTAC 


300 




ATTGACATTG 


ACATCCACCA 


TGGTGACGGG 


GTTCAAGAAG 


CTTTCTACCT 


CACTGACCGG 


360 


50 


GTCATGACGG 


TGTCCTTTC 










379 




(2) INFORMATION FOR SEQ ID NO: 3: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 5 base pairs 



(i) 



30 



wo 97/35990 



PCT/US97/05275 



-115- 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : both 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TACTACTGTC TGAACGTGCC CCTGCGGATG GGCATTGATG ACCAGAGTTA CAAGCACCTT 60 

TTCCAGCCGG TTATCAACCA GGTAGTGGAC TTCTACCAAC CCACGTGCAT TGTGCTCCAG 120 

15 TGTGGAGCTG ACTCTCTGGG CTGTGATCGA TTGGGCTGCT TTAACCTCAG CATCCGAGGG 180 

CATGGGGAAT GCGTTGAATA TGTCAAGAGC TTCAATATCC CTCTACTCGT GCTGGGTGGT 240 

GGTGGTTATA CTGTCCGAAA TGTTGCCCGC TGCTGGACAT ATGAGACATC GCTGCTGGTA 300 

GAAGAGGCCA TTAGTGAGGA GCTTCCCTAT AGTGAATACT TCGAGTACTT TGCCCCAGAC 3 60 
TTCACACTTC ATCCA 
25 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



20 



35 



40 



(ii) MOLECULE TYPE: cDNA 



375 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GGTCATGCTA AATGTGTAGA AGTTGTAAAA ACTTTTAACT TACCATTACT GATGCTTGGA 60 

GGAGGTGGCT ACACAATCCG TAATGTTGCT CGATGTTGGA CATATGAGAC TGCAGTTGCC 120 

CTTGATTGTG AGATTCCCAA TGAGTTGCCA TATAATGATT ACTTTGAGTA TTTTGGACCA 180 

45 GACTTCAAAC TGCATATTAG TCCTTCAAAC ATGACAAACC AGAACAC 227 

(2) INFORMATION FOR SEQ ID NO: 5: 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
^ Met Ala Gin Thr Gin Gly Thr Arg Arg Lys Val Cys Tyr Tyr Tyr Asp 

Gly Asp val Gly Asn Tyr Tyr Tyr Gly Gin Gly His Pro Met Lys Pro 

25 30 

10 His Arg lie Arg Met Thr His Asn Leu Leu Leu Asn Tyr Gly Leu Tyr 

40 45 

Arg Lys Met Glu lie Tyr Arg Pro His Lys Ala Asn Ala Glu Glu Met 
15 ^5 60 

Thr Lys Tyr His Ser Asp Asp Tyr He Lys Phe Leu Arg Ser He Arg 

^° 75 80 

2^ Pro ASP Asn Met Ser Glu Tyr Ser Lys Gin Met Gin Arg Phe Aon Val 

85 90 95 

Gly Glu ASP cys Pro Val Phe Asp Gly Leu Phe Glu Phe Cys Gin Leu 
100 105 110 

25 ser Thr Gly Gly Ser Val Ala Ser Ala Val Lys Leu Asn Lys Gin Gin 

120 125 



Thr Asp He Ala Val Asn Trp Ala Gly Gly Leu His His Ala Lys Lys 
30 140 

ser Glu Ala Ser Gly Phe Cys Tyr Val Asn Asp He Val Leu Ala He 

155 160 
Leu Glu Leu Leu Lys Tyr His Gin Arg Val Leu Tyr He Asp He Asp 



170 



175 



He Hia His Gly Asp Gly Val Glu Glu Ala Phe Tyr Thr 



180 in" ^^"^ "^^^ Arg 

40 Val Met 



185 190 



Thr val ser Phe His Lys Tyr Gly Glu Tyr Phe Pro Gly Thr 

200 205 

Gly Aap Leu Arg Asp He Gly Ala Gly Lys Gly Lys Tyr Tyr Ala Val 

45 ^-^^ 220 

225 """^ T.f: °1" Glu Ala He 



230 

235 240 



Phe Lys Pro Val 



50 '"^ ?!J °1" Phe Gin Pro Ser Ala 

245 250 255 



val val Leu Gin Cys Gly Ser Asp Ser Leu Ser Gly Asp Arg Leu Gly 

265 270 
55 cys Phe Asn Leu Thr He Lys Gly His Ala Lys Cys Val Glu Phe 



Val 
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275 280 285 

Lys Ser Phe Asn Leu Pro Met Leu Met Leu Gly Gly Gly Gly Tyr Thr 
^ 290 295 300 

He Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Vai Ala Leu 

310 315 320 

Asp Thr Glu He Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr 
10 325 330 335 

Phe Gly Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn 
340 345 2S0 

15 Gin Asn Thr Asn Glu Tyr Leu Glu Lys He Lys Gin Arg Leu Phe Glu 
355 360 365 

Asn Leu Arg Met Leu Pro His Ala Pro Gly Val Gin Met Gin Ala He 
2Q ^"^^ 375 380 

Pro Glu Asp Ala He Pro Glu Glu Ser Gly Asp Glu Asp Glu Asp Asp 

390 395 400 

^ Pro Asp Lys Arg He Ser He Cys Ser Ser Asp Lys Arg He Ala Cys 

405 410 415 

Glu Glu Glu Phe Ser Asp Ser Glu Glu Glu Gly Glu Gly Gly Arg Lys 
420 425 430 

30 Asn Ser Ser Asn Phe Lys Lys Ala Lys Arg Val Lys Thr Glu Asp Glu 
435. 440 445 

Lys Glu Lys Asp Pro Glu Glu Lys Lys Glu Val Thr Glu Glu Glu Lys 
3^ 450 455 460 

Thr Lys Glu Glu Lys Pro Glu Ala Lys Gly Val Lys Glu Glu Val Lys 

470 475 480 



40 



45 



50 



55 



Leu Ala 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



KDCID: <WO 9735990 A3 IA> 
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Ile Asp Phe Leu Gin Arg Val Ser Pro Thr Asn Met Gin Gly Phe Thr 
5 10 15 

Lya Ser Leu Asn Ala Phe Asn Val Gly Asp Asp Cys Pro Val Phe Pro 
20 25 30 

Gly Leu Phe Glu Phe Cys Ser Arg Tyr Thr Gly Ala Ser Leu Gin Gly 

40 45 



10 



Ala Thr Gin Leu Asn Asn Lys lie Cys Asp He Ala He Asn Trp Ala 

55 60 

Gly Gly Leu His His Ala Lys Lys Phe Glu Ala Ser Gly Phe Cys Tyr 
" 70 75 el 

val Asn Asp He Val Phe Gly He Leu Glu Leu Leu Lys Tyr His Pro 



85 90 



95 



20 



25 



Arg Val Leu Tyr He Asp He Asp He His His Gly Asp Gly Val Gin 

105 110 

Glu Ala Phe Tyr Leu Thr Asp Arg Val Met Thr Val Ser Phe Pro Gin 

120 125 

He Arg Glu He Tyr 
130 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Tyr Tyr Cys Leu Asn Val Pro Leu Arg Met Gly He Asp Asp Gin Ser 
45 ^ ^° 15 

Tyr Lys His Leu Phe Gin Pro Val He Asn Gin Val Val Asp Phe Tyr 
^° 25 30 

Gin Pro Thr Cys He Val Leu Gin Cys Gly Ala Asp Ser Leu Gly Cys 
J5 40 45 

Asp Arg Leu Gly Cys Phe Asn Leu Ser He Arg Gly His Gly Glu Cys 
^° 55 60 

val Glu Tyr Val Lys Ser Phe Asn He Pro Leu Leu Val Leu Gly Gly 



55 
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70 75 80 

Gly Gly Tyr Thr Val Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr 
5 85 90 95 

Ser Leu Leu Val Glu Glu Ala He Ser Glu Glu Leu Pro Tyr Ser Glu 
100 105 110 

Tyr Phe Glu Tyr Phe Ala Pro Asp Phe Thr Leu His Pro 
10 115 120 125 

(2) INFORMATION FOR SEQ ID NOr8: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



30 



35 



Asn Leu Leu Val Leu Gly His Ala Lys Cys Val Glu Val Val Lys Thr 
1 5 10 15 

Phe Asn Leu Pro Leu Leu Met Leu Gly Gly Gly Gly Tyr Thr He Arg 
20 25 30 

Asn Val Ala Arg Cys Trp Thr Tyr Glu Thr Ala Val Ala Leu Asp Cys 
35 40 45 - 

Glu He Pro Asn Glu Leu Pro Tyr Asn Asp Tyr Phe Glu Tyr Phe Glv 
50 55 60 



Pro Asp Phe Lys Leu His He Ser Pro Ser Asn Met Thr Asn Gin Asn 

80 



40 65 70 75 



(2) INFORMATION FOR SEQ ID NO: 11: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1275 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 
55 (A) NAME/KEY: CDS 



Omn- *'WO Q^racQonA-a lA^ 
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(B) LOCATION: l.,1275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GCC GAC AAG GAA GCA GCC TTC GAC GAC GCA GTG GAA GAA CGA GTG 
Met Ala Asp Lys Glu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 
1 5 in -• .- 



TGG CTT CCA GAT GTA ACC AGA CCA GAA GGG AAA GAT TTC AGC ATT CAT 

xg Pro Glu Gly Lys Asp 
55 60 



Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser He His 
20 50 



Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys He He Ala Thr Lys Thr 
40 130 



48 



96 



5 10 15 

10 ATC AAC GAG GAA TAC AAA ATA TGG AAA AAG AAC ACC CCT TTT CTT TAT 

He Asn Glu Glu Tyr Lys He Trp Lys Lys Asn Thr Pro Phe Leu Tyr 

20 25 30 

GAT TTG GTG ATG ACC CAT GCT CTG GAG TGG CCC AGC CTA ACT GCC CAG 144 

13 Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 

35 40 45 



192 



CGA CTT GTC CTG GGG ACA CAC ACA TCG GAT GAA CAA AAC CAT CTT GTT 240 
Arg Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val 

70 75 80 

ATA GCC AGT GTG CAG CTC CCT AAT GAT GAT GCT CAG TTT GAT GCG TCA 288 
He Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
85 90 , 95 

CAC TAC GAC AGT GAG AAA GGA GAA TTT GGA GGT TTT GGT TCA GTT AGT 336 
His Tyr Asp Ser Glu Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Ser 
100 105 110 

GGA AAA ATT GAA ATA GAA ATC AAG ATC AAC CAT GAA GGA GAA GTA AAC 384 
35 Gly Lys He Glu He Glu He Lys He Asn His Glu Gly Glu Val Asn 
115 120 125 



AGG GCC CGT TAT ATG CCC CAG AAC CCT TGT ATC ATC GCA ACA AAG ACT 432 

Gin Asn Pro Cys He He 
135 140 



CCT TCC AGT GAT GTT CTT GTC TTT GAC TAT ACA AAA CAT CCT TCT AAA 480 
Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 

150 155 160 

CCA GAT CCT TCT GGA GAG TGC AAC CCA GAC TTG CGT CTC CGT GGA CAT 528 
Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 
165 170 175 

CAG AAG GAA GGC TAT GGG CTT TCT TGG AAC CCA AAT CTC AGT GGG CAC 576 
Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 

TTA CTT AGT GCT TCA GAT GAC CAT ACC ATC TGC CTG TGG GAC ATC AGT 624 
35 Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp He Ser 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GCC GTT 
Ala Val 
210 

GGG CAT 
Gly His 
225 

TCT CTG 
Ser Leu 



195 

CCA AAG 
Pro Lys 



GAG GGA 
Glu Gly 



ACG GCA 
Thr Ala 



TTT GGG 
Phe Gly 



ACT CGT 
Thr Arg 



ACT GCT 
Thr Ala 



CTT GCC 
Leu Ala 
290 

AAT CTG 
Asn Leu 
305 

TTC CAG 
Phe Gin 



TCA AAC 
Ser Aan 
260 

GAA GTG 
Glu Val 
275 

ACA GGA 
Thr Gly 



GTA GTA 
Val Val 
230 

TCA GTT 
Ser Val 
245 

AAT ACT 
Asn Thr 



200 

AAA GTG 
Lys Val 
215 

GAA GAT 
Glu Asp 



GTA GAT GCG 
Val Asp Ala 



GTT TCC TGG 
Val Ser Trp 
235 



GCT GAT 
Ala Asp 



TCC AAA 
Ser Lys 



GAT CAG AAA 
Asp Gin Lye 
250 



CCA AGC CAC 
Pro Ser His 
265 



AAC TGC 
Asn Cys 



TCA GCT 
Ser Ala 



AAA CTT 
Lys Leu 



GTT CAG 
Val Gin 



GGT ACT 
Gly Thr 



GAA CAA 
Glu Gin 



ATT CAT 
lie His 
370 

AAT GAA 
Asn Glu 
385 

GTG TGG 
Val Trp 



GAT CGC 
Asp Arg 
340 

TCC CCA 
Ser Pro 
355 

GGT GGT 
Gly Gly 



AAG TTG 
Lys Leu 
310 

TGG TCA 
Trp Ser 
325 

AGA CTG 
Arg Leu 



CTT TCT 
Leu Ser 
280 

GAC AAG 
Asp Lys 
295 

CAT TCC 
His Ser 



TTC AAT CCT 
Phe Asn Pro 



ACT GTT GCC 
Thr Val Ala 



CCT CAC 
Pro His 



AAT GTC 
Asn Val 



TTT GAG TCA 
Phe Glu Ser 
315 

AAT GAG ACT 
Asn Glu Thr 
330 



GAA GAT 
Glu Asp 



CAT ACT 
His Thr 



CCT TGG 
Pro Trp 



CAA ATG 
Gin Met 



GTG ATT 
Val He 
390 

GCA GAG 
Ala Glu 
405 



GCA GAA 
Ala Glu 
360 

GCC AAG 
Ala Lys 
375 

TGT TCT 
Cys Ser 



TGG GAT TTA 
Trp Asp Leu 
345 

GAC GGG CCA 
Asp Gly Pro 



ATA TCT GAT 
He Ser Asp 



AAC ATT 
Asn He 



GTA TCA GAA 
Val Ser Glu 
395 

TAT AAT GAT 
Tyr Asn Asp 
410 



AGC GTG GAT CCA GAA GGA CAA GGG TCC TAG 



205 

AAG ACC ATC TTT ACA 672 
Lys Thr He Phe Thr 
220 

CAT CTA CTC CAT GAG 720 
His Leu Leu His Glu 
240 

CTT ATG ATT TGG GAT 768 
Leu Met He Trp Asp 
255 

TCA GTT GAT GCT CAC 816 
Ser Val Asp Ala His 
270 

TAT AGT GAG TTC ATT 864 
Tyr Ser Glu Phe He 
285 

TTG TGG GAT CTG AGA 912 
Leu Trp Asp Leu Arg 
300 

CAT AAG GAT GAA ATA 960 
His Lys Asp Glu He 
320 

ATT TTA GCT TCC AGT 1008 
He Leu Ala Ser Ser 
335 

AGT AAA ATT GGA GAG 1056 
Ser Lys He Gly Glu 
350 

CCA GAG TTG TTG TTT 1104 
Pro Glu Leu Leu Phe 
365 

TTC TCC TGG AAT CCC 1152 

Phe Ser Trp Asn Pro 

380 



GAC AAT ATC ATG CAA 1200 
Asp Asn He Met Gin 
400 

GAA GAC CCT GAA GGA 1248 
Glu Asp Pro Glu Gly 
415 

1278 



fl7Tl.SPOnA3 IA> 
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Ser Val Asp Pro Glu Gly Gin Gly Ser 
420 

{2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Asp Xaa Xaa Xaa Asn Xaa Xaa Gly Gly Leu His His Ala Lys Lys Xaa 

20 ' 15 

Glu Ala ser Gly Phe Cys Tyr Xaa Asn Asp He Val Xaa Xaa He Xaa 
20 25 30 



25 



30 



Glu Leu Leu Xaa Tyr His Xaa Arg Val Xaa Tyr He Asp Xaa Asp Xaa 
35 40 45 

His His Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Arg Val 
50 55 60 

Met Thr Xaa Ser Phe 
65 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: peptide 
(V) FRAGMENT TYPE: internal 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13; 



50 



Asp He Ala Xaa Asn Trp Ala Gly Gly Leu His His Ala Lys Lys Xaa 
^5 10 15 

Glu Ala ser Gly Phe Cys Tyr Val Asn Asp He Val Xaa Xaa He Leu 
20 25 30 

Glu Leu Leu Lys Tyr His Xaa Arg Val Leu Tyr He Asp He Asp He 
55 " « 
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His His Gly Asp Gly Xaa Glu Glu Ala Phe Tyr Xaa Thr Asp Ara Val 
50 55 60 

Met Thr Val Ser Phe 
65 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Cys Val Xaa Xaa Xaa Lys Xaa Phe Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly 
^ S 10 15 

Gly Gly Gly Tyr Thr Xaa Arg Asn Val Ala Arg Xaa Trp Xaa Xaa Glu 
20 25 30 

Thr 



30 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 



40 



45 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Val Glu Xaa Val Lys Xaa Phe Aen Xaa Pro Leu Leu Xaa Leu Gly 
1 5 10 " 



15 



Gly Gly Gly Tyr Thr Xaa Arg Asn Val Ala Arg Cys Trp Thr Tyr Glu 
20 25 30 



50 Thr 
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What is daimeri is- 

1 An isolated or recombinant HDx polypeptide. 

2. The polypeptide of claim 1, of mammalian origin. 

3. The polypeptide ofclaim 3, of human origin. 

5 4. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragmem thereof 

5. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

6. The polypeptide of claim 1, which polypeptide comprises an HDx polypeptide 
10 sequence designated in SEQ ID No: 2. 

7. The polypeptide of claim 1, which polypeptide is encoded by a nucleic acid having a 
coding sequence, or portion thereof, which hybridizes under stringent conditions to 
the nucleic acid designated in SEQ ID No. 1 

9. The polypeptide of claim 1 , which polypeptide is an acetylase activity. 

10. The polypeptide of claim 1. which polypeptide binds to an RbAp48 protein. 

11. The polypeptide of claim 1 , which polypeptide is a fusion protein. 

12. The polypeptide of claim 1, which polypeptide has a molecular weight in the ranee 
of 45-70 Kd. 

13. An isolated or recombinant polypeptide comprising in HDx polypeptide sequence 
homologous or identical SEQ ID No. 2. or a fragment thereof which retains one or 
more of (i) a histone deacetylase activity, (ii) a histone binding activity and (iii) an 
RbAp48 binding activity. 

14. The polypeptide of claim 13. which polypeptide comprises an HDx sequence 
represented i„ the general fonnula 



15 



20 



25 DXXNXGGLHHAKKXEASGFCYXNDIVXXI- 

XELLXYHXR\OCnDXDXHHGDGXEAFYXTDRVMTXSF. 
15. The polypeptide of claim 13, which polypeptide comprises an HDx sequence 
represented in the general , formula 

CVXXXKXFXXPXXXXGGGGYTXRNVARX-WXXET. 

30 16. The polypeptide of claim 13, which polypeptide deacetylates acetylated histones. 
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17. The polypeptide of claim 13, which polypeptide is a dominant negative inhibitor 
which antagonizes deacetylation of acetylated histones. 

18. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

5 19. The polypeptide of claim 13, which polypeptide comprises an HDx sequence at 
least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

20. The polypeptide of claim 13, which polypeptide includes at least 25 amino acid 
residues oianHDx polypeptide sequence. 

21. The polypeptide of claim 13, wherein said polypeptide modulates cellular 
10 proliferation. 

22. An isolated or recombinant polypeptide comprising an HDx polypeptide sequence 
represented SEQ ID No. 2, or a fragment thereof which retains one or more of (i) a 
histone deacetylase activity, (ii) a histone binding activity and (iii) an RbAp48 
binding activity. 

15 23. The polypeptide of claim 22, wherein said fusion protein includes, as a second 
polypeptide sequence, a polypeptide which functions as a detectable label for 
detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fusion protein. 

24. The polypeptide of claim 13. wherein said polypeptide is a fusion protein further 
20 comprising, in addition to said HDx sequence, a second polypeptide sequence 

having an amino acid sequence unrelated to an HDx polypeptide sequence. 

25. A purified or recombinant HDx polypeptide encoded by a nucleic acid which 
hybridizes under stringent conditions to a nucleotide sequence designated in SEQ 



ID No. 1. 



25 26. 



A purified or recombinant HDx polypeptide comprising a v motif represented in the 
S^"^''^' formula 
DL\XiNWAGGLHHAKKX2EASGFCYVNDIVX3X4lLELLKYH- 
X5RVLYIDIDIHHGDGX6EAFYX7TDRVMTVSF and a x motif represented in 
CVEXJVKX2FNX3P-X4LX5LGGGGYTX6RNVARCWTYET. 
30 27. An isolated nucleic acid which encodes a deacetylase activity and hybridizes under 
stringent conditions to a nucleotide sequence designated in SEQ ID No. 1. 



9735990A3 IA> 
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28. An isolated nucleic acid encoding an HDx polypeptide, which polypeptide 
specifically modulates histone acetylation. 

29. The nucleic acid of claim 28, which HDx polypeptide comprises a v motif 
represented in the general formula 

5 DIAX1NWAGGLHHAKKX2EASGFCYVNDIVX3X4ILELL- 

KYHXsRVTYIDIDIHHGDGXfiEAFYXyTDRVMTVSF and a x motif 
represented in CVEX1VKX2FNX3P-X4LX5LGGGGYTX6RNVARCWTYET. 

30. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence at least 88 percent homologous with SEQ ID No: 2, or fragment thereof 

10 31. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence at least 95 percent homologous with SEQ ID No: 2, or fragment thereof 

32. The nucleic acid of claim 28, which HDx polypeptide comprises a polypeptide 
sequence designated in SEQ ID No: 2. 

33 . The nucleic acid of claim 28, which HDx polypeptide has a molecular weight in the 
15 range of 45-70 Kd. 

34. The nucleic acid of claim 28, which HDx polypeptide is a fusion protein further 
comprising, in addition to HDx polypeptide sequences, a second polypeptide 
sequence having an amino acid sequence unrelated to a nucleic acid sequence. 
The nucleic acid of claim 34, wherein said fusion protein includes, as a second 
polypeptide sequence, a polypeptide which functions as a detectable label for 
detecting the presence of said fusion protein or as a matrix-binding domain for 
immobilizing said fiision protein. 

The nucleic acid of claim 28, further comprising a transcriptional regulatory 
sequence operably linked to said nucleotide sequence so as to render said nucleic 
25 acid suitable for use as an expression vector. 

An expression vector, capable of replicating in at least one of a prokaryotic cell and 
eukaryotic cell, comprising the nucleic acid of claim 36. 

A host cell transfected with the expression vector of claim 37 and expressing said 
recombinant polypeptide. 

30 39. A method of producing a recombinant HDx polypeptide comprising culturing the 
cell of claim 38 in a cell culture medium to express said recombinant polypeptide 
and isolating said recombinant polypeptide from said cell culture. 



35 

20 



36. 



37. 



38. 
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40. A transgenic non-human animal having cells which harbor a heterologous transgene 
encoding an HDx polypeptide. 

41. A transgenic non-human animal having cells in which an HDx gene is disrupted. 

42. A recombinant transfection system, comprising 

5 (i) a gene construct including the nucleic acid of claim 28 and operably linked to a 

transcriptional regulatory sequence for causing expression of said HDx 
polypeptide in eukaryotic cells, and 

(ii) a gene delivery composition for delivering said gene construct to a cell and 
causing the cell to be transfected with said gene construct. 

10 43. The recombinant transfection system of claim 42, wherein the gene delivery 
composition is selected from a group consisting of a recombinant viral particle, a 
liposome, and a poly-cationic nucleic acid binding agent. 

44. A nucleic acid composition comprising a substantially purified oligonucleotide, said 
oligonucleotide including a region of nucleotide sequence which hybridizes under 

15 stringent conditions to at least 25 consecutive nucleotides of sense or antisense 

sequence of an HDx gene. 

45. The nucleic acid composition of claim 44, which oligonucleotide hybridizes under 
stringent conditions to at least 50 consecutive nucleotides of sense or antisense 
sequenc of an HDx gene. 

20 46. The nucleic acid composition of claim 44, wherein said oligonucleotide further 
comprises a label group attached thereto and able to be detected. 

47. The nucleic acid composition of claim 44, wherein said oligonucleotide has at least 
one non-hydrolyzable bond between two adjacent nucleotide subunits. 

48. A test kit for detecting cells which contain an //Dx-encoding nucleic acid, 
comprising the nucleic acid composition of claim 44 for measuring, in a sample of 
cells, a level of nucleic acid encoding an HDx protein. 

49. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to //Dx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of an agent which modulates the 
deacetylase activity of an HDx polypeptide thereby altering, relative to the cell in 
the absence of the agent, at least one of (i) rate of growth, (ii) differentiation, or 

(iii) survival of the cell. 



25 



30 
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50. An antibody to an HDx polypeptide. 

51. The antibody of claim 50, wherein said antibody is monoclonal. 

52. A diagnostic assay for identifying a cell or cells at risk for a disorder characterized 
by unwanted cell proliferation or differentiation, comprising detecting, in a cell 

5 sample, the presence or absence of a genetic lesion characterized by at least one of 

(i) aberrant modification or mutation of a gene encoding an HDx protein, and (ii) 
mis-expression of said gene; wherein a wild-type form of said gene encodes an iffisc 
protein characterized by an ability to modulate the signal transduction activity of a 
TGF3 receptor. 

10 53 . The assay of claim 52, wherein detecting said lesion includes: 

i. providing a diagonistic probe comprising a nucleic acid including a region of 
nucleotide sequence which hybridizes to a sense or antisense sequence of said 
gene, or naturally occuring mutants thereof, or 5' or 3' flanking sequences 
naturally associated with said gene; 

1 5 ii. combining said probe with nucleic acid of said cell sample; and 

iii. detecting, by hybridization of said probe to said cellular nucleic acid, the 
existence of at least one of a deletion of one or more nucleotides from said 
gene, an addition of one or more nucleotides to said gene, a substitution of 
one or more nucleotides of said gene, a gross chromosomal rearrangement of 
all or a portion of said gene, a gross alteration in the level of an mRNA 
transcript of said gene, or a non-wild type splicing pattern of an mRNA 
transcript of said gene. 

54. The assay of claim 53, wherein hybridization of said probe further comprises 
subjecting the probe and cellular nucleic acid to a polymerase chain reaction (PGR) 

25 and detecting abnormalities in an amplified product. 

55. The assay of claim 53, wherein hybridization of said probe further comprises 
subjecting the probe and cellular nucleic acid to a ligation chain reaction (LCR) and 
detecting abnormalities in an amplified product. 

56. The assay of claim 53, wherein said probe hybridizes under stringent conditions to 
30 the nucleic acid designated by SEQ ID No. 1 . 

57. An assay for screening test compounds to identify agents which inhibit the 
deacetylation of histones comprising: 



20 
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i- providing a reaction mixture including a histone deacetylase activity of an 
HDx-Yike polypeptide, a substrate for a histone deacetylase, and a test 
compound; and 
ii. detecting the conversion of the substrate to product, 
5 Mrherein a statistically signficant decrease in the conversion of the substrate in the 

presence of the test compound is indicative of a potential inhibitor of histone 
deacetylation. 

58. The assay of claim 57, wherein the HDx-like polypeptide is of mammalian origin. 

59. The assay of claim 57, wherein the HDx-like polypeptide is an RPD3-like 
10 deacetylase of fungal origin. 

60. The assay of claim 57, wherein the reaction mixture is a reconstituted protein 
mixture. 

61. The assay of claim 57, wherein said reaction mixture is a cell lysate. 

62. The assay of claim 57, wherein the HDx-like polypeptide is a recombinant protein. 
15 63. An assay for screening test compounds to identify agents which inhibit histone 

deacetylase interaction with cellular proteins, comprising: 

i. providing a reaction mixture including an HDxAikc protein, an HDx- 
binding protein, and a test compound; and 

ii. detecting the interaction of the HDx-Vike protein and the HDx binding 
20 protein, 

wherein a statistically signficant decrease in the interaction of the proteins in the 
presence of the test compound is indicative of a potential inhibitor of a histone 
deacetylase. 

64. The assay of claim 63, wherein the HDx-like protein is of mammalian origin. 

25 65. The assay of claim 63, wherein the HDx-like polypeptide is an RPD3-Iike 
deacetylase of fungal origin. 

66. The assay of claim 63, wherein the HDx-like protein is a histone, or a portion 
thereof which interacts with an //Dx-Iike polypeptide. 

67. The assay of claim 63, wherein the HDx-like protein is an PbAp48 protein, or a 
30 portion thereof which interacts v/ith an HDx-likc polypeptide. 

68. The assay of claim 63, wherein the reaction mixture is a reconstituted protein 
mixture. 



wo 97/35990 



PCT/US97/05275 



■130- 



69. The assay of claim 63, wherein said reaction mixture is a cell lysate. 

70. The assay of claim 63, wherein theHDx-Jike polypeptide is a recombinant protein. 

71. The assay of claim 63. wherein one or both of the ^Dx-like protein and HDx- 
binding protein is a fusion protein. 

5 72. The assay of claim 63, wherein at least one of the ^Dx-like protein and HDx- 
binding protein comprises an endogenous detectable label for detecting the 
formation of said complex. 

73. The method of claim 63, which reaction mixture is a whole cell, and interaction of 
the HDx-like protein and //Z)x-binding protein is detected in a two hybrid assay 

10 system . 

74. A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

^ is selected from the group consisting of substituted and unsubstituted 
C4-C8 alkylidenes, C4-C8 alkenylidenes. C4-C8 alkynylidenes, and -(D-E-F)-, in 
which D and F are. independently, absent or represent a C2-C7 alkylidene, a Cj- 
C7 alkenylidene or a C2-C7 alkynylidene, and E represents O, S, or NR', in which 
R' represents H, a lower alky], a lower alkenyl, a lower alkynyl, an aralkyl, aiyl, or 
20 a heterocyclyl; and 

C is selected from the group consisting of 
Y O H 

0 00 O O' J ^ . 

' ' > , and a boromc 

acid; in which Z represents O, S, or NR5, and Y; R5 represents a hydrogen, an^ 
alkyl, an alkoxycarbonyl, an aiyloxycarbonyl, an alkylsulfonyl, an arylsulfonyl or 
an aryl; R'^ represents hydrogen, an alkyl, an alkenyl, an alkynyl or an aryl; and R7 
represents a hydrogen, an alkyl, an aryl, an alkoxy, an aryloxy, an ^o, a 
hydroxylamino, an alkoxylamino or a halogen; with the proviso that the 
compound is not trapoxin. 



25 
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75. A pharmaceutical preparation comprising (i) the composition of claim 74 in an 
amount effective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. 

5 76. A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to HDx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 74 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 
10 rate of survival of the cell. 

77. A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected from the group consisting of cycloalkyls, unsubstituted and 
15 substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-C8 alkylidenes, C4-C8 alkenylidenes, C4-Cg alkynylidenes, and -(D-E-F)-, in 
which D and F are, independently, absent or represent Cj-Cy alkylidenes, C2-C7 
alkenylidenes or C2-C7 alkynylidenes, and E represents O, S, or NR', in which R' 
20 represents H, a lower alkyl, a lower alkenyl, a lower alkynyl, an aralkyl, aii aryl, 

or a heterocyclyl; and 

C is selected from the group consisting of 

Y Y o 



N' 

H H O . ^. 

> . , in which R9 represents a hydrogen, 

an alkyl, an aryl, a hydroxyl, an alkoxy, an aryloxy or an amino, 

with the proviso that the inhibitor compound is not trichostatin. 

78. A pharmaceutical preparation comprising (i) the composition of claim 77 in an 
amount eflFective for inhibiting proliferation of a cell, and (ii) a pharmaceutically 
acceptable diluent. 
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79. 



80. 



A method for modulating one or more of growth, differentiation, or survival of a 
mammalian cell responsive to ^i5x-mediated histone deacetylation. comprising 
treatmg the cell with an effective amount of the compisition of claim 77 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 
rate of survival of the cell. 

A composition for inhibiting a histone deacetylase comprising a compound 
represented by the general formula A-B-C, wherein 

A is selected fi-om the group consisting of cycloalkyls, unsubstituted and 
substituted aryls, heterocyclyls, amino acyls, and cyclopeptides; 

B is selected from the group consisting of substituted and unsubstituted 
C4-C8 alkyhdenes, C4-C8 alkenylidenes, C4-C8 alkynylidenes, and -(D-E-F)- in 
which D and F are. independently, absent or a C2-C7 alkylidene, a C2-C7 
alkenyhdene. or a C^-C^ alkynylidene, and E represents O, S, or NR', in which R' 
IS H, lower alkyl, lower alkenyl. lower alkynyl. aralkyl. aryl, or heterocyclyl; and 

Y 



10 



20 



81. 




C represents ; in which Y is O or S, and R7 represents a 

hydrogen, an alkyl. an aryl, an alkoxy. an aryloxy, an amino, a hydroxylamino, an 
alkoxylamino or a halogen. 

A phannaceutical preparation comprising (i) the composition of claim 80 in an 
amount effective for inhibiting proliferation of a cell, and (ii) a phamiaceutically 

acceptable diluent. 



82. A method for modulating one or more of growth, differentiation, or survival of a 
mammahan cell responsive to //ZPx-mediated histone deacetylation, comprising 
treating the cell with an effective amount of the compisition of claim 80 so as to 
modulate the deacetylase activity and alter, relative to the cell in the absence of the 
agent, at least one of (i) the rate of growth, (ii) the differentiation state, or (iii) the 
30 rate of survival of the cell. 
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Figure lA 
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Figure IB 
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Figure 2 A 
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Figure 2B 
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Figure 3B 
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Figure 4B "O 
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Figure 4D 
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Figure 5A 



Sequences 

1> HDl 2> HSC11A021 3> R2ll3f; 

S> R18769 6> D31480 ' 7> R98879 """^^^ 

/> R98879 8> N59055 

200 

1> SCTGAGGAGATGACCAAGTACCACAGCGATGACTACAT-] ' 



2 > "'-'-v.Al ''ACTACATTAAATTCTTGCGCTCCATCCGT 

6 > NCTGAGGAGATGACCAAGTAKCACAGCGATGAC "^^^'^^^^^^'^GAGAGTCAGC 

7 > 

TCCTGCAGAGAGTCAGC 

^^^°^!^f^??!=°°^«^^^*°^°CAGAXGC^^^^ 
5> •^'^TTAATGCCTTCAACGTAGGCGATGACTGC 



2> CCCACCAATATGCAAGGCTTCACCAAGAGTC- 

3TCTTAATGCCTTCAACGTAGGCGATGACTGC 



7> CCCACCAATATGCAAGGCTTCACCAAGAGT CGATGACTGC 



CCAGTGTTTCCCGGGCTCTTTGAGTTCTGC 



CTCGCGTTACACAGGCGCATCTCTGCAA< 



.GGA 



TCGCGTTACACAGGCGCATCTCTGCAAGGA 
400 

GCAACCCAGCTGAACAACAAGATCTGTGATAT 



ATGTCAACGACATTGTGTTTGGCATC 



IA> 
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Figure 5A (cont.) 

5 > NATGCCANGANGTTTNAGGCCTCTGGNTTCTGCTATGTCAACGACATTGTGATTGGCATC 

7> catgccaagaagtttgaggcctctggtttctgctatgtcaacgacat^SgaSSS^c 

500 

1> CTGGAACTGCTAAAGTATCACCAGAGGGtgCTGTACATTGACATTGATATTCACCATGGT 
"^^*^^°5I^^?!*^^^^^^'^'=^*=°°T°CTCTACATTGACATTGACATCCACCATGGT 



\TTGACATCCACCA 



7 > CTGGAGCTGCTCAAGTACCACCCTCGGGT.GCTCTACATTGACAl 



550 

1> GACGGCGTGGAAGAGGcCTTCTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAr^ 
2 > GACGGGGTTCAAGAAGCTTTCTACCTCACTGACC 

5 > GACGGGGTTCAAGAAGCTTTCTACCTCACTGACCGGGTCATGACGGTGTCCTTTCCACAA 

CTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG 

650 

1> TATGGAGAGTACTTCCCAGGAACTGGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG 
5> ATACGGGAAATTTACTTNTTCCNGGGGCACAGGTGACATGTTNTGGAAGTTCGGGGGGCA 
1 0 > TATGGAGAGTACTTCCCAGGGACTTGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG 



700 

1 > TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTGATGACGAGTCCTATGAGGCCATT 

3 > TACTACTGTCTGAACGTGCCCCTGCGGATGGGCATTGATGACCAGAGTTACAAGCACCTT 
5> GGAGAGTTGGCCC 

1 0 > TATTATGCTGTTAACTACCCGCTCCGAGACGGGATTNATGACGAGTCCTATGAGGCCATT 

750 

1> TTCAAGCCGGTCATGTCCAAAGTAATGGAGATGTTCCAGCCTAGTGCGGTGGTCTTACAG 
3 > TTCCAGCCGGTTATCAACCAGGTAGTGGACTTCTACCAACCCACGTGCATTGTGCTCCAG 

CCCTATAGTGAGTCGTATTNN 
1 0 > TTCAAGCCGGTCATGTCCAAAGTAATNGAGATGTTCCAGCCTAGTGCG 

800 

1> TGTGGCTCAGACTCCCTATCTGGGGATCGGTTAGGTTGCTTCAATCTAACTATCAAAGGA 
3 > TGTGGAGCTGACTCTCTGGGCTGTGATCGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 
8 > TNAAAACATGACTCACTNGGNTNNNTACGATTGGGCTGCTTTAACCTCAGCATCCGAGGG 

AGGT 



1> CACgCCAAGTGTGTGGAATTTGTCAAGAGCTTTAACCTGCCTATGCTGATGCTGGGAGGC^ 

3 > CATGGGGAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 

4 > 

GG A 

8 > CATGGGNAATGCGTTGAATATGTCAAGAGCTTCAATATCCCTCTACTCGTGCTGGGTGGT 

9 > NATGCTAAATGTGTAGAAGTTGTAAAAACTTTTAACTTACCATTACTGATGCTTGG AGGA 
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Figure 5A (cont) 

950 

1> GGTGGTTACACCATTCGTAACGTTGCCCGGTGCTGGACATATGAGACAGCTGTGGCCCTG 

3 > GGTGGTTATACTGTCCGAAATGTTGCCCGCTGCTGGACATATGAGACATCGCTGCTGGTA 

4 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

8 > GGTGGTTATACTGTCCGAAATGTNGCCCGCTGCTGGACATATGAGACANCGCTGCTGGTA 

9 > GGTGGCTACACAATCCGTAATGTTGCTCGATGTTGGACATATGAGACTGCAGTTGCCCTT 

1000 

1 > GATACGGAGATCCCTAATGAGCTTCCATACaATGACTACTTTGAATACTTTGGACCAGAT 

3 > GAAGAGGCCATTAGTGAGGAGCTTCCCTATAGTGAATACTTCGAGTACTTTGCCCCAGAC 

4 > GATTGTGAGATTCCCAATGAGTTGCCATATAATGATTACTTTGAGTATTTTGGACCAGAC 

8 > GAAGAGGCCATTAGTGAGGAGCTTCCCTAATAGTGAATACTTCGNTACTTTGCCCCAGAC 

9 > GATTGTGAGATTCCCAATGGTAAGTGTTCTCATTACAATATCTTTATTGTATG 

1050 

1> TTCAAGCTCCACATCAGTCCTTCCAATATGACTAACCAGAACACGAATGAGTACCtGGAG 
3> TTCACACT 

4 > TTCAAACTGCATATTAGTCCTTCAAACATGACAAACCAGAACAC 

8 > TTCACACTTCATCCANATGTCAGCACCCGCATCGAGAATCCAGAACTCACGCCAGTATC 

1100 

1 > AAGATCAAACAGCGAGTGTTTGAGAACCTTAGAATGCTGCCGCACGCACCTGGGGTCCAA 
8 > NGGACCAAGATCCGCCAGACAATCTTTGNAAACCTGAAGGTTCTTNAACC 

1150 .... 1200 

1> ATGCAGGCGATTCCTGAGGACGCCATCCCTGAGGAGAGTGGCGATGAGGACGAAGACGAC 

1250 

1> CCTGACAAGCGCAXCTCGATCTGCTCCTCTGACAAACGAATTGCCTGTGAGGAAGAGTTC 

1300 

1 > TCCGATTCTGAAGAGGAGGGAGAGGGGGGCCGCAAGAACTCTTCCAACTTCAAAAAAGCC 

1350 

1 > AAGAGAGTC AAAACAGAGGATGAAAAAGAGAAAGACCCAGAGGAGAAGAAAGAAGTCACC 

1400 . 

1 > GAAGAGGAGAAAACCAAGGAGGAGAAGCCAGAAGCCAAAGGGGTCAAGGAGGAGGTCAAG 
1> TTGGCCTGA 



9> F06693 
10> H05234 
11> R21136 
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Figure 5B 



HDl 

RPD3 

x_rpd3 

<EstA> 
HDl 
RPD3 
x_rpd3 

<EstA> 
HDl 
RPD3 
x_rpd3 

<EstB> 

<EstA> 

HDl 

RPD3 

X_rpd3 

<Est:C> 

<Est:B> 

HDl 

RPD3 

x_rpd3 

<9/4> 

<3> 

HDl 

RPD3 

x_rpd3 

<EstC> 
<EstB> 
RPD3 
x_rpd3 

HDl 

RPD3 

X_rpd3 

HDl 

RPD3 

x_rpd3 



^ '"^^^^^'^RR^^CYTYDGDVGhTTfYGQGHPMKPHRIRMTHNLLLNYGL^^ 

( 1) "ivyeatpfdpIWKPSDKRRVAYFTOADVGNYAYGAGHPMKPHRIRMAHSLIIWYGLYKK 
HA ^TLGTKKKVCYYYDGDVGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRK 



( 1) 



IDF*LQRVSPTNMQGFTKSLNAFNVGDDCPVFPGLFEFC 
51) MEIYRPHKANAEEMTKYHSDDYIKFLRSIRPDNMSEYSKOMQRFNVGEDCPVFDGLFEFC 
( 61) MEI YRAKPATKQEMCQFHTDEYIDFLSRVTPDNLEMFKRESVKFMVGDDCPVFDGLYEYC 
( 51) MEIFRPHKASAEDMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFC 

SRYTGASLQGATQLNNKICDIAINWAGGLHHAKKFEASGFCYVNDIVFGILELLKYHPRV 
( 111) OLSTGGSVASAVKLNKQQTDIAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRV 
{ 121) SISGGGSMEGAARLNRGKCDVAVNYAGGLHHAKKSEASGFCYLNDIVLGIIELLRYHPRV 
{ 111) Qt-SAGGSVASAVKLNKOOTDISVNWSGGLHHAKKSEASGFCYVNDIVLAILELLKYHORV 



( 171) 



YYCLNVPLRM 

LYIDrDIHHGDGVQEAFYLTDRVMTVSFPQlREIY 

LYIDIDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAWNYPLRD 
( 181) LYIDIDVHHGDGVEEAFYTTDRVMTCSFHKYGEFFPGTGELRDIGVGAGKNYAXmVPLRD 
( 171) VYIDXDIHHGDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNYALRD 

NLLVLGHAKCVEWKT 

GIDDQSYKHLFQPVINQWDFYQPTCIVLQCGADSLGCDRLGCFNLSIRGHGECVEYVKS 
( 231) GIDDESYEAIFKPVMSKVMEMFQPSAWLQCGSDSLSGDRLGCFWLTIKGHAKCVEFVKS 
( 241) GIDDATYRSVFEPVIKKIMEWYOPSAWLQCGGDSLSGDRLGCFNLSMEGHANCVNYVKS 
{ 231) GIDDESYEAIFKPVMSKVMEMFQPSAWLQCGADSLSGDRLGCFNLTIKGHAKCVEFIKT 

FNLPLLMLGGGGYTIRNVARCWTYETAVALDCEIPNELPYNDYFEYFGPDFKLHISPSNM 
FNXPLLVLGGGGYTVRNVARCWTYETSLLVEEAISEELPYSEYFEYFAPDFTLHP 
( 291) FNLPMLMLGGGGYTIRNVARCWTYETAVALDTEIPNELPYNDYFRYFGPDFKLHISPSNM 
( 301) FGIPMMWGGGGYTMRNVARTWCFETGLLN^m;LDKX>LPYNEYYEYYGPDYKLSVRPS^^^ 
( 291) FNLPLLMLGGGGYTIRI^ARCWTYETAVALDSEIPNELPYNDYFEYFGPDFKLHISPSNM 

TNQN 

( 351) TNQNTNEYLEKIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESGDEDEDDPDKRISICSS 
( 361) FNVNTPEYLDKVMTNIFANLENTKYAPSVOLNHTPRDaedlgdveedsaeakdt)cggsqy 
( 351) TNQNTNEYLEKIKQRLFENLRMLPHAPGVQMQAVAEDSIHDDSGEEDEDDPDKRISIRSS 

( 411) DKRIACEEEFSDSEEEGEGGRKNSSNFKKAKRVKTEDERE)cdPEEKKEVTEEEKTKEEKP 
( 421) ardlhvehdnef y 

( 411) DKRIACDEEFSDSEDEGEGGRKNVANFKKVKRVKTEEEKE--GEDKKDVICEEEKAKDEKT 

( 4 71) EAKGVKEEVRla 

( 434) 

( 46 9) DSKRVKEETKsv 
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Figure 7 
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Figure 8A 
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Figure 8B 




9735990A3_IA> 



wo 97/35990 



18/26 



PCT/US97/05275 



Figure 8C 
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Figure 9A 
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Figure 9B 
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Figure 9C 
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Figure lOA 
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Figure llA 



Figure IIB 
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Figure IIC 
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Figure 12A 
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